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features. Anatomic similarity is a reflection of 
developmental, physiological and biochemical similarities 
shared by these insects, in particular, the principles 
governing the role of insect ecdysteroid-hormone 
receptors in DyosophUfr development, as described above, 
likely are shared by many different types of insects. 

As one weapon against the destruction of cultivated 
crops by insects, organic molecules with pesticidal 
properties are used commonly in attempts to eliminate 
insect populations. However, the ecological side effects 
of these pesticides, due in part to their broad activity 
and. lack of specificity, and also in part to the fact 
that some of these pesticides are not easily 
biodegradable, significantly affect populations of both 
insects and other species of animals. Some of these 
populations may be advantageous from an ecological or 
other perspective. Furthermore, as the insect 
populations evolve to minimize the effects of the applied 
pesticides, greater amounts of pesticides must be 
applied, causing significant direct and indirect effects 
on other animals, including humans. Thus, an important 
need exists for both highly specific and highly active 
pesticides which are biodegradable. Novel insect 
hormones which, like the ecdysteroids, act by complexing 
with insect members of the steroid receptor superfamily 
to control insect development, are likely candidates for 
pesticides with these desirable properties. 

The use of insect hormones may also have other 
important applications. Many medically and commercially 
important proteins can be produced in a usable form by 
genetically engineered bacteria. However, many expressed 
proteins are processed incorrectly in bacteria and are 
preferably produced by genetically engineered eucaryotic 
cells. Typically, yeast cells or mammalian tissue- 
culture cells are used. Because it has been observed 
that protein processing of foreign proteins in yeast 
cells is also frequently inappropriate, mammalian 
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cultured cells have become the central focus for the 
production of many proteins. It is commonly known that 
the production of large amounts of foreign proteins makes 
these cells unhealthy, which may affect adversely the 
5 yield of the desired protein. This problem may be 

circumvented, in part, by using an inducible expression 
system; In such a system, the cells are engineered so 
that they do not express the foreign protein until an 
inducing agent is added to the growth medium. In this 

10 way, large quantities of healthy cells can be produced 

and then induced to produce large amounts of the foreign 
protein. Unfortunately, in the presently available 
systems, the inducing agents themselves, such as metal 
ions or high temperature, adversely affect the cells, 

15 thus again lowering the yield of the desired foreign 

protein the cells produce. A need therefore exists for 
the development of benign inducing factors for efficient 
production of recombinant proteins. Such factors could 
also prove invaluable for the therapy of human patients 

20 suffering from inability to produce particular proteins, 

treatment with these factors controlling both the timing 
and the abundance of the protein produced in the affected 
individual . 

The hormones that complex with mammalian or other 
25 vertebrate members of the steroid receptor superfamily 

are unlikely candidates as such benign factors because 
they would alter the expression of many target genes in 
cells bearing these receptors, thereby adversely 
affecting the host cells. 
30 For these and other reasons, obtaining steroid 

receptors or nucleic acid information about them has been 
a goal of researchers for several years. Unfortunately, 
efforts have been unsuccessful despite a significant 
investment of resources. The absence of information on 
35 the structure and molecular biology of steroid receptors 

has significantly hindered the ability to produce such 
products . 
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Thus, there exists a need for detailed sequence 
information on insect members of the steroid receptor 
super family, and the genes that encode these receptors 
and for resulting reagents. Reagents are provided which 
are useful in finding new molecules which may act as 
agonists or antagonists of natural insect members of the 
steroid receptor superfamily, or as components of systems 
for highly specific regulation of recombinant proteins in 
mammalian cells. 
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SUMMARY OF THE INVENTION 
In accordance with the present invention, isolated 
recombinant nucleic acids are provided which, upon 
expression, are capable of coding for other than a native 
vertebrate steroid receptor or fragment thereof. 
These nucleic acids typically comprise a segment having a 
sequence substantially homologous to one or more coding 
regions of domains A, B, D, E, or F from an insect 
steroid preceptor superfamily member gene having steroid 
binding domain homology, e.g., EcR, DHR3 , E75A, or E75B. 
Preferably, the .nucleic acids encode a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member and are capable of 
hybridizing to an insect steroid receptor superfamily 
25 member gene segment under selective hybridization 

conditions, usually stringent hybridization conditions. 
Mammalian cells transformed with the nucleic acids are 
also provided. 

In another embodiment, isolated recombinant nucleic 
acids are included that have sequence exhibiting identity 
over about 20 nucleotides of a coding segment of an 
insect steroid receptor superfamily member having steroid 
binding domain homology. The nucleic acids can be 
transformed into cells to express a polypeptide which 
binds to a control element responsive to a ligand of an 
insect steroid receptor superfamily. 
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Alternatively, an isolated DNA molecule is provided 
comprising a DNA sequence capable of binding to an insect 
steroid receptor superfamily member other than 20-OH 
ecdysone receptor, such as DHR3 , E75A, or E75B. The DNA 
5 sequence will generally be present in an expression 

vector and promote transcription of an operably linked 
sequence (e.g., encoding a polypeptide) in response to 
binding by an insect steroid receptor superfamily member. 
Cells comprising the nucleic acids are provided, as are 

10 cells expressing the polypeptides. In certain 

embodiments, non-insect cells will be used, including 
mammalian cells. Also contemplated are recombinant 
nucleic acids comprising a controlling element responsive 
to a ligand, e.g., a ligand which binds to an insect 

15 steroid receptor superfamily member ligand responsive 

controlling element, a non-heat shock promoter sequence 
(e.g., an alcohol dehydrogenase promoter) and a sequence 
comprising a reporter gene. Usually the controlling 
element Will operate to make transcription of the 

20 reporter gene responsive to the presence of the ligand. 

Additional embodiments of the present invention 
include polypeptides comprising an insect steroid 
receptor superfamily member or fragment thereof, wherein 
such polypeptide is substantially free of naturally- 

25 associated insect cell components and exhibits a 

biological activity characteristic of an insect steroid 
receptor superfamily member with a hormone binding 
domain. Preferably, the insect steroid receptor 
superfamily member or fragment thereof also comprises a 

30 DNA binding domain and the polypeptide is capable of 

binding to a hormone analogue selected from the group 
consisting of an insect hormone, an insect hormone 
agonist and an insect hormone antagonist. The 
polypeptide can comprise a zinc-finger domain and usually 

35 is capable of binding to a DNA controlling element 

responsive to an insect hormone. As desired, the 
polypeptide will be fused to a second polypeptide, 
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typically a heterologous polypeptide which comprises a 
second steroid receptor superfamily member. Cells, often 
mammalian cells, comprising the protein are provided. 

Fragments of such polypeptides can have a sequence 
substantially homologous to consensus El, E2 or E3 region 
sequences. By way of example, a preferred fragment has a 
sequence comprising: 

a segment at least about 25% homologous to a 
consensus El region sequence; 

a segment at least about 30% homologous to a 

consensus E2 region sequence; and 
a segment at least about 30% homologous to a 

consensus E3 region sequence. 
The polypeptides of the present invention have a 
15 variety of utilities. For example, a method for 

selecting DNA sequences capable of being specifically 
bound by an insect steroid receptor superfamily member 
can comprise the steps of screening DNA sequences for 
binding f o such polypeptides and selecting DNA sequences 
exhibiting such binding. Alternatively, methods for 
selecting ligands, e.g., ecdysteroid analogues, specific 
for binding to a hormone binding domain of an insect 
steroid receptor superfamily member can comprise the 
steps of screening compounds for binding to one or more 
superfamily members and selecting compounds exhibiting 
specific binding to the members. 
Also included are methods for modulating insect 
physiology or development (e.g., killing) comprising the 
steps of screening compounds for binding to an insect 
steroid receptor superfamily member, selecting compounds 
exhibiting said binding and administering the ligand to 
an insect. 

Additionally provided are methods for selecting 
ligands specific for binding to a ligand binding domain 
of an insect steroid receptor superfamily member 
comprising combining: 
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(i) a fusion polypeptide which comprises a 

ligand binding domain functionally 
linked to a DNA binding domain of a 
second steroid receptor superfamily 
member ; and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of the second nucleic acid 
sequence is responsive to binding by 
the DNA binding domain; 
screening compounds for an activity of inducing 
expression of the second polypeptide; and 

selecting those compounds which do so. 
This will often be performed in a cell, e.g., with cells 
transformed with DNA encoding a fusion protein. This 
method allows selction of analogues which are useful in 
modulating insect physiology or development. 

Also provided are methods for producing a 
polypeptide comprising the steps of: 

selecting a cell, typically a mammalian or plant 
cell which is substantially insensitive to 
exposure of an insect steroid receptor 
superfamily ligand; 
introducing into said cell: 

(i) a receptor for the ligand; and 
(ii) a nucleic acid sequence encoding the 
polypeptide, the nucleic acid 
sequence operably linked to a 
controlling element responsive 
to presence of the selected 
ligand, wherein a transformed 
cell is produced; and 
exposing the transformed cell to the ligand. 
Usually the cell will be a mammalian cell, and will 
sometimes be introduced into a whole organism, e.g., a 
plant or animal. 
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Kits for the determination of expression levels of 
the nucleic acids and proteins provided herein are made 
available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
binds to the desired target molecule, e.g., ligand 
analogues, receptors, or nucleic acids. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1. pMTEcR, a Cu 2+ -inducible EcR expression plasmid. 
The P^., ecR ORF and Act 5c poly a elements are defined in 
Experimental Example III, part A. The HYG r ORF confers 
hygromycin resistance and is under control of the 
promoter in the LTR of Drpsophjia transposable elements 
SSEia. The SV40 intron/poly A element provides an intron 
for a possible splicing requirement, as well as a 
polyadenylation/cleavage sequence for the HYG r ORF mRNA 
The pAT153 DMA derives from a bacterial plasmid. 
Figure 2. The ecdysone-inducible pEcRE/Adh/flgal reporter 
plasmid., see the text of Experimental Example in, part 
B, for the construction of this plasmid and the 
definitions of all symbols (except the SV40 splice and 
poly A) which are defined in the figure legend. 
Figure 3. The constitutive EcR expression plasmid, 
pActEcR. The construction of this plasmid and the 
definition of the symbols are given in Experimental 
Example III, part B. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
The present invention provides novel isolated 
nucleic acid sequences encoding polypeptide products 
exhibiting the structure and/or activities of insect 
members of the steroid receptor superfamily. Having 
elucidated the structures of these insect steroid 
receptors from their genes, the separate ligand-binding 
domains and DNA-binding domains are used individually or 
in combination to screen for new ligands or DNA sequences 
which bind to these domains. Thus, for example, by 
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binding to promoter sequences incorporating a DNA binding 
site, these receptors will usually control expression of 
reporter genes for which sensitive assays exist. Or, the 
hormone-binding domains serve as reagents for screening 
5 for agonists or antagonists of steroid receptor 

superfamily members. Either new classes of molecules, or 
selected modifications of known ligands will be screened 
for receptor binding. New ligands obtained in this way 
find use as highly specific and highly active naturally 

10 occurring pesticides. Alternatively, structural 

information about interactions between the ligand and 
binding domains directs methods for mutagenizing or 
substituting particular residues in the binding domains, 
thereby providing for altered binding specificity. Thus, 

15 inter alia , the present invention provides for screening 

for new ligand molecules, for the design of new ligand - 
binding domain interactions, for producing novel chimeric 
steroid receptor superfamily members and for generating 
new combJLnations of ligands and binding domains. 

20 The present invention also provides for the 

isolation or identification of new steroid hormone- 
responsive elements and associated genes. By appropriate 
operable linkage of selected sequences to DNA controlling 
elements which are responsive to binding by the DNA- 

25 binding domains of steroid receptor superfamily members, 

new regulatory combinations result. The present 
invention further provides for the design of either a 
binding domain in a member of the insect steroid receptor 
superfamily that will recognize given DNA sequences, or 

30 conversely for the modification of DNA sequences which 

will bind to particular receptor DNA-binding domains. 
Both the DNA-binding domain of a superfamily-member 
polypeptide and its DNA recognition sequence can be 
coordinately modified to produce wholly new receptor-DNA 

3 5 interactions . 

In an alternative embodiment, a DNA-binding sequence 
recognized by a selected receptor will be operably linked 
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to a desired genetic sequence for inducible expression. 
Thus, upon administration of a ligand specific for that 
selected receptor, the genetic sequence is appropriately 
regulated. Expression systems are constructed that are 
responsive to administration of insect steroid receptor 
superfamily-specific ligands. By identifying and 
isolating new members of the insect steroid receptor 
superfamily, new, useful regulatory reagents become 
available, both hormones and controlling elements. 

In another embodiment, highly regulatable expression 
of a gene is achieved by use of regulatory elements 
responsive to ligands specific to the superfamily 
members. If transformed cells are grown under conditions 
where expression is repressed or not induced, the cells 
will grow to higher densities and enjoy less stressful 
conditions. Upon reaching high density, the regulatory 
ligand molecule is added to cause high expression. 
Selected cells otherwise insensitive to the inducing 
ligand will not be affected by exposure to the ligand 
used to regulate expression. This provides a means both 
for highly efficient regulatable expression of genes, and 
for introduction of these genes into intact organisms. 

In accordance with specific embodiments of the 
present invention, nucleic acid sequences encoding 
portions of insect steroid hormone receptor superfamily 
members have been elucidated. DNA encoding four 
different members of the Drosophila steroid receptor 
superfamily have been characterized: (1) the 20-OH 
ecdysone receptor, also called the ecdysone receptor 
(EcR) , for which a full-length encoding sequence has been 
determined; (2) Drosophila hormone receptor 3 (DHR3) , a 
protein with sequence homology to various steroid 
receptor superfamily members; (3 and 4) E75A and E75B, 
closely related proteins, encoded by segments of the same 
gene, and each possessing sequence homology to other 
steroid receptor superfamily members. 
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The DNA sequences encoding each of these members of 
the insect steroid receptor superfamily provide probes 
for screening for homologous nucleic acid sequences, both 
in Drosophila and other sources. This screening allows 
5 isolation of homologous genes from both vertebrates and 

invertebrates. Production of large amounts of the 
encoded proteins is effected by inserting those sequences 
into expression systems . 

The EcR, DHR3, E75A, and E75B genes are each linked 

10 to similar DNA sequences which likely function as 

controlling, or regulatory elements which are responsive 
to insect steroids. The present invention provides for 
the isolation of these hormone-responsive control 
elements, and for their use in regulating gene 

15 expression. One embodiment of a DNA construct comprises: 

(1) multiple copies of an insect steroid receptor 
superfamily controlling element linked to (2) a minimal 
gene promoter, preferably not a heat shock gene promoter, 
which provides highly inducible expression of (3) an 

20 operably linked gene. This construct provides a very 

sensitive assay for the presence of the controlling 
molecule of the receptor. 

Another aspect of the present invention involves 
cells comprising: (1) isolated recombinant gene segments 

25 encoding biologically active fragments of insect steroid 

receptor superfamily proteins; (2) DNA sequences which 
bind insect steroid receptors, e.g., the elements 
involved in hormone-responsive control; or (3) modified 
receptor proteins. Transformed cells are understood to 

30 include their progeny. In particular, the present 

invention provides for a system whereby expression of 
polypeptides is responsive to steroid induction. For 
instance, a system which expresses a desired protein in 
response to exposure to ecdysone analogues is constructed 

35 by operably linking a promoter having an ecdysone- 

responsiye enhancer to a peptide encoding segment. 
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The present invention also provides insect steroid 
receptor proteins substantially free from naturally- 
associated insect cell components. Such receptors will 
typically be either full-length proteins, functional 
fragments, or fusion proteins comprising segments from an 
insect steroid receptor protein fused to a heterologous, 
or normally non-contiguous, protein domain. 

The present invention further provides a number of 
methods for utilizing the subject receptor proteins. One 
aspect of the present invention is a method for selecting 
new hormone analogues. The isolated hormone-binding 
domains specifically bind hormone ligands, thereby 
providing a means to screen for new molecules possessing 
the property of binding with high affinity to the ligand- 
binding region. Thus, a binding domain of an insect 
steroid receptor superfamily member will be used as a 
reagent to develop a binding assay. On one level, the 
binding domains are useful as affinity reagents for a 
batch or a column selective process, i.e., to selectively 
retain ligands which bind. Alternatively, a functional 
assay is preferred for its greater sensitivity to ligand- 
binding, whether a direct binding assay or an indirect 
assay in which binding is linked to an easily assayed 
function. For example, by operable linkage of an easily 
assayable reporter gene to a controlling element 
responsive to binding by an insect steroid receptor 
superfamily member, in which ligand-binding induces 
protein synthesis, an extremely sensitive assay for the 
presence of a ligand or of a receptor results. Such a 
construct useful for assaying the presence of 20-OH 
ecdysone is described below. This construct is useful 
for screening for agonists or antagonists of the 20-OH 
ecdysone ligand. 

In particular, this method allows selecting for 
ligands which bind to an "orphan" receptor, i.e., a 
receptor whose ligand is unknown. Binding domains for 
"unknown" ligands will often originate from either newly 
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identified insect steroid receptor superfamily members, 
or from mutagenesis. A hybrid receptor will be created 
with a ligand-binding domain and DNA-binding domain from 
different sources. For example, a hybrid receptor 
5 between a putative binding domain and a known DNA-binding 

domain would allow screening for ligands. An "orphan 
receptor" binding domain will be functionally linked to a 
known DNA-binding domain which will control a known 
reporter gene construct whose expression will be easily 
10 detected. This system for ligand-receptor binding 

provides an extremely sensitive assay for ligand-receptor 
interactions • 

Alternatively, the recognition of important features 
of tertiary structure and spatial interactions between a 

15 ligand-binding domain from an insect steroid receptor 

superfamily member and its ligand will allow selection of 
new combinations of ligand-binding domains with ligands. 
Either method provides for selecting unusual ligands 
which specifically bind a modified polypeptide-binding 

20 domain of a receptor. This approach allows selection of 

novel steroid hormone analogues which exhibit modified 
specificity for binding to a subgroup of steroid 
receptors . 

The present invention also provides for new and 
useful combinations of the various related components: 
the recombinant nucleic acid sequences encoding the 
polypeptides, the polypeptide sequences, and the DNA 
sites to which the receptors bind (i.e., the regulatory, 
or control, elements). For instance, fusing portions of 
30 nucleic acid sequences encoding peptides from different 

sources will provide polypeptides exhibiting hybrid 
properties, e.g., unusual control and expression 
characteristics. In particular, hybrid receptors 
comprising segments from other members of the 
35 superfamily, or from other sources, will be made. 

combining an insect steroid receptor-responsive enhancer 
segment with a different polypeptide encoding segment 
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will produce a steroid-responsive expression system for 
that polypeptide. 

The isolation of insect steroid receptors provides 
for isolation or screening of new ligands for receptor 
binding. Some of these will interfere with, or disrupt, 
normal insect development. These reagents will allow the 
user to accelerate or decelerate insect development, for 
instance, in preparing sterile adults for release. 
Alternatively, a delay or change in the timing of 
development will often be lethal or will dramatically 
modify the ability of an insect to affect an agricultural 
crop. Thus, naturally occurring biodegradable and highly 
active molecules able to disrupt the timing of insect 
development will result. 

Furthermore, these polypeptides provide the means by 
which have been raised antibodies possessing specificity 
for binding to particular steroid receptor classes. 
Thus, reagents will be produced for determining, 
qualitatively or quantitatively the presence of these or 
homologous polypeptides. Alternatively, these antibodies 
will be used to separate or purify receptor polypeptides. 

Transcription sequences o f insect steroid T*nap i-n r 
superfamjlY members 

The ecdysone receptor gene is a member of the 
steroid and thyroid hormone receptor gene superfamily, a 
group of ligand-responsive transcription factors. See , 
Evans (1988) Science 240:889-895; and Segraves, Molecular 
and genetic Analysis of thg E 7 5 Ecd Y sonfi-R esPQ n S ivg r^P 
of Drosophila mel anoaastar (Ph.D. thesis, Stanford 
University 1988) , both of which are hereby incorporated 
herein by reference for all purposes. These receptors 
show extensive sequence similarity, especially in their 
"zinc finger" DNA-binding domains, and also in a ligand 
35 (° r bormone or steroid) binding domain. Modulation of 

gene expression apparently occurs in response to binding 
of a receptor to specific control, or regulatory, DNA 
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elements. The cloning of receptor cDNAs provides the 
first opportunity to study the molecular bases of steroid 
action. The steroid receptor superfamily is a class of 
receptors which exhibit similar structural and functional 

5 features. While the term insect is used herein, it will 

be recognized that the sane methods and molecules will be 
derived from other species of animals, in particular, 
those of the class Insecta . or, more broadly, members of 
the phylum Arthropoda which use ecdysteroids as hormones. 

10 Members of the insect steroid receptor superfamily are 

characterized by functional ligand-binding and DNA 
binding domains, both of which interact to effect a 
change in the regulatory state of a gene operably linked 
to the DNA-binding site of the receptor. Thus, the 

15 receptors of the insect steroid receptor superfamily seem 

to be ligand-responsive transcription factors. The 
receptors of the present invention exhibit at least a 
hormone-binding domain characterized by sequence homology 
to particular regions, designated El, E2 and E3. 

20 The members of the insect steroid receptor 

superfamily are typically characterized by structural 
homology of particular domains, as defined initially in 
the estrogen receptor. Specifically, a DNA-binding 
domain, C, and a ligand-binding domain, E, are separated 

25 and flanked by additional domains as identified by Krust 

et al. (1986) EMBO J. 5:891-897, which is hereby 
incorporated herein by reference. 

The C domain, or zinc-finger DNA-binding domain, is 
usually hydrophilic, having high cysteine, lysine and 

30 arginine content ~ a sequence suitable for the required 

tight DNA binding. The E domain is usually hydrophobic 
and further characterized as regions El, E2 and E3. The 
ligand-binding domains of the present invention are 
typically characterized by having significant homology in 

35 sequence and structure to these three regions. Amino 

proximal to the C domain is a region initially defined as 
separate A and B domains. Region D separates the more • 
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conserved domains C and E. Region D typically has a 
hydrophilic region whose predicted secondary structure is 
rich in turns and coils. The F region is carboxy 
proximal to the E region (see, Krust et al. , supra 1 . 

The ligand-binding domain of the members of the 
insect steroid receptor superfamily is typically 
carboxyl-proximal, relative to a DNA-binding domain 
described below. See, Evans (1988) Science 240:889-895. 
The entire hormone-binding domain is typically between 
about 200 and 250 amino acids but is potentially shorter. 
This domain has the subregions of high homology, 
designated the El, E2 and E3 regions, gee,, e.g., 
Table 4. 

The El region is 19 amino acids long with a 
consensus sequence AKX(L/I)PGFXXLT(L/I) (D/E) DQITLL, where 
X represents any amino acid and the other letters are the 
standard single-letter code. Positions in parentheses 
are alternatives. Typically, members of the insect 
steroid receptor superfamily will have at least about 
five matches out of the sixteen assigned positions, 
preferably at least about nine matches, and in more 
preferred embodiments, at least about ten matches. 
Alternatively, these insect steroid receptor superfamily 
members will have, homologous sequences exhibiting at 
least about 25% homology, normally at least about 30% 
homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 
least about 45% homology, typically at least about 50% 
homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 
at positions assigned preferred amino acids. 

The E2 region is a 19 amino-acid segment with a 
35 consensus sequence: 

E(F/Y) (A/V) (L/C) (L/M)KA(I/L) (V/L)L(L/I) 
(N/S) (S/P)D(P/-) (R/K) (P/D)GL , 
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where - represents an optional absence of an amino acid. 
Typically, an insect steroid receptor superfamily member 
will exhibit at least about six matches, preferably at 
least about eight matches and more preferably at least 
5 about nine matches. Alternatively, E2 sequences of 

insect steroid receptor superfamily members exhibit at 
least about about 25% homology, normally at least about 
30% homology, more normally at least about 35% homology, 
generally at least about 40% homology, more generally at 

10 least about 45% homology, typically at least about 50% 

homology, more typically at least about 55% homology, 
usually at least about 60% homology, more usually at 
least about 70% homology, preferably at least about 80% 
homology, and more preferably at least about 90% homology 

15 at positions assigned preferred amino acids* 

The E3 region is a 12 amino-acid segment with a 
consensus sequence 

LXKLLXXLPDLR . 
The insefat steroid receptor superfamily members will 

20 typically show at least about four matches out of the 

nine assigned preferences in the E3 region, preferably at 
least about five matches and more preferably at least 
about six matches. Alternatively, over the assigned 
positions, members of the insect steroid receptor 

25 superfamily will exhibit at least about 25% homology, 

normally at least about 30% homology, more normally at 
least about 35% homology, generally at least about 40% 
homology, more generally at least about 45% homology, 
typically at least about 50% homology, more typically at 

30 least about 55% homology, usually at least about 60% 

homology, more usually at least about 70% homology, 
preferably at least about 80% homology, and more 
preferably at least about 90% homology at positions 
assigned preferred amino acids . 

35 in preferred embodiments, the insect steroid 

receptor superfamily members will exhibit matching of at 
least about five positions in an El region, at least 
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about six positions in an E3 region and at least about 
four positions in an E3 region- The El, E2, and E3 
regions are defined, e.g., in Table 4, 

The DNA-binding domain of these insect steroid 
5 receptor super family members is characterized by a "zinc 

fingers" motif. See, Evans (1988) Science 240:889-895. 
The domain is typically amino proximal to the ligand, or 
hormone, binding site. Typically, the DNA-binding domain 
of the insect steroid receptor superfamily members is 

10 characterized by clustering of basic residues, a 

cysteine-rich composition and sequence homology. See . 
Evans, R. M. (1988) Science 240:889-89; and Experimental 
section below. Significant polypeptide sequence homology 
among superfamily members exists . The insect steroid 

15 receptor superfamily members will exhibit at least about 

30% homology in the 67 ± 1 amino acid region of this 
domain, normally at least about 40% homology, usually at 
least about 45% homology, and preferably at least about 
55% homplogy. 

20 Steroids are derivatives of the saturated 

tetracyclic hydrocarbon perhydrocyclopentanophenanthrene. 
Among the molecules in the group ^steroids" are: the bile 
acids, e.g., cholic acid, and deoxycholic acid; the 
adrenocortical steroids, e.g., corticosterone and 

25 aldosterone; the estrogens, e.g., estrone and 6- 

estradiol; the androgens, e.g., testosterone and 
progesterone; and the ecdysteroids . The terms steroid or 
steroid hormones are used interchangeably herein and are 
intended to include all steroid analogues. Typically, 

30 steroid analogues are molecules which have minor 

modifications of various peripheral chemical groups. 
See, Koolman (ed.) (1989), cited, above, for details on 
ecdysteroids . 

Although ligands for the insect steroid receptor 

35 superfamily members have historically been characterized 

as steroids, the term "steroid" as in "insect steroid 
receptor superfamily" is not meant only literally. The ■ 
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use of "steroid" has resulted from a historical 
designation of members of a group recognized initially to 
include only molecules having specific defined molecular 
structures. However, this limitation is no longer 
applicable since functions are no longer only correlated 
with precise structures. Thus, there will be members of 
the insect steroid receptor superfamily, as defined 
herein, whose ligand-binding specificities are not 
directed to classically defined "steroids." Typically, 
the ligands for members of the insect steroid receptor 
superfamily are lipophilic molecules which are structural 
analogues of steroid molecules. 

The term ligand is meant to refer to the molecules 
that bind the domain described here as the "hormone- 
binding domain." Also, a ligand for an insect steroid 
receptor superfamily member is a ligand which serves 
either as the natural ligand to which the member binds, 
or a functional analogue which serves as an agonist or 
antagonist. The classical definition of "hormone" has 
been defined functionally by physiologists, see, e.g., 
Guyton, Textbook of Medical Physiology, Saunders, 
Philadelphia. The functional term "hormone" is employed 
because of historic usage, but is meant to apply to other 
chemical messengers used to communicate between cell 
types. Recently the distinction between hormones and 
neurotransmitters has been eroded as various peptide 
neurotransmitters have been shown to exhibit properties 
of classically defined hormones. These molecules are 
typically used in intercellular signal transduction, but 
are not limited to those molecules having slow or 
systemic effects, or which act at remote sites. 

Substantial homology in the nucleic acid context 
means either that the segments, or their complementary 
strands, when compared, are identical when optimally 
aligned, with appropriate nucleotide insertions or 
deletions, in at least about 40% of the residues, 
generally at least about 45%, more generally at least 
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about 50%, normally at least about 55%, more normally at 
least about 60%, typically at least about 65%, more 
typically at least about 70%, usually at least about 75%, 
more usually at least about 80%, preferably at least 
about 85%, and more preferably at least about 95% of the 
nucleotides. Alternatively, substantial homology exists 
when the segments will hybridize under selective 
hybridization conditions, to a strand or its complement,, 
typically using a sequence derived from Table 1, 2 or 3. 
Selectivity of hybridization exists when hybridization 
occurs which is more selective than total lack of 
specificity. Normally, selective hybridization will 
occur when there is at least about 55% homology over a 
stretch of at least about 14 to 25 nucleotides, generally 
at least about 65%, typically at least about 75%, usually 
at least about 85%, preferably at least about 90%, and 
more preferably at least about 95% or more, see , 
Kanehisa, M. (1984), Nucleic A cids Res, 12:203-213, which 
is incorporated herein by reference. Stringent 
hybridisation conditions will include salt concentrations 
of less than about 2.5 M, generally less than about 1.5 
M, typically less than about 1 M, usually less than about 
500 mM, and preferably less than about 200 mM. 
Temperature conditions will normally be greater than 
20 *c, more normally greater than about 25 'C, generally 
greater than about 30 *c, more generally greater than 
about 35 'C, typically greater than about 40 *C, more 
typically greater than about 45 -C, usually greater than 
about 50 -C, more usually greater than about 55 *C, and in 
particular embodiments will be greater than 60 "C, even as 
high as 80 *C or more. As other factors may significantly 
affect the stringency of hybridization, including, among 
others, base composition and size of the complementary 
strands, presence of organic solvents and extent of base 
mismatching, the combination of parameters is more 
important than the absolute measure of any one. 
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A gene for an insect steroid receptor superfamily 
member gene includes its upstream (e.g., promoter) and 
downstream operably linked controlling elements, as well 
as the complementary strands. See, generally, Watson et 
5 al. (1987) The Molecular Bio logy of the Gene. Benjamin, 

Menlo Park, which is hereby incorporated herein by 
reference. A gene geneally also comprises the segment 
encoding the transcription unit, including both introns 
and exons. Thus, an isolated gene allows for screening 

10 for new steroid receptor genes by probing for genetic 

sequences which hybridize to either controlling or 
transcribed segments of a receptor gene of the present 
invention. Three segments of particular interest are the 
controlling elements, both upstream and downstream, and 

15 segments encoding the DNA-binding segments and the 

hormone-binding segments. Methods applicable to such 
screening are analogous to those generally used in 
hybridization or affinity labeling. 

Nucleic acid probes will often be labeled using 

20 radioactive or non-radioactive labels, many of which are 

listed in the section on polypeptide labeling. Standard 
procedures for nucleic acid labeling are described, e.g., 
in Sambrook et al. (1989) ; and Ausubel et al. (1987 and 
supplements) . 

25 

Insect steroid receptor superfamily member polypeptides 
A polypeptide sequence of the ecdysone receptor is 
represented in Table 2. Other insect steroid receptor 
superfamily member polypeptide sequences are set forth in 

30 Tables 1 and 3. Preferred nucleic acid sequences of the 

cDNAs encoding these insect steroid receptor superfamily 
member polypeptides are also provided in the 
corresponding tables. Other nucleic acids will be used 
to encode the proteins, making use of the degeneracy or 

35 non-universality of the genetic code. 

As used herein, the term "substantially pure" 
describes a protein or other material, e.g., nucleic 
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acid, which has been separated from its native 
contaminants. Typically, a monomeric protein is 
substantially pure when at least about 60 to 75% of a 
sample exhibits a single polypeptide backbone. Minor 
variants or chemical modifications typically share the 
same polypeptide sequence. Usually a substantially pure 
protein will comprise over about 85 to 90% of a protein 
sample, and preferably will be over about 99% pure. 
Normally, purity is measured on a polyacrylamide gel, 
with homogeneity determined by staining. Alternatively, 
for certain purposes high resolution will be necessary 
and HPLC or a similar means for purification will be 
used. For most purposes, a simple chromatography column 
or polyacrylamide gel will be used to determine purity. 

The term "substantially free of naturally-associated 
insect cell components" describes a protein or other 
material, e.g., nucleic acid, which is separated from the 
native contaminants which accompany it in its natural 
insect cell state. Thus, a protein which is chemically 
synthesized or synthesized in a cellular system different 
from the insect cell from which it naturally originates 
will be free from its naturally-associated insect cell 
components. The term is used to describe insect steroid 
receptor superf amily members and nucleic acids which have 
been synthesized in mammalian cells or plant cells, E. 
coli and other procaryotes. 

The present invention also provides for analogues of 
the insect steroid receptor superf amily members. Such 
analogues include both modifications to a polypeptide 
backbone, e.g., insertions and deletions, genetic 
variants, and mutants of the polypeptides. Modifications 
include chemical derivatizations of polypeptides, such as 
acetylations, carboxylations and the like. They also 
include glycosylation modifications and processing 
variants of a typical polypeptide. These processing 
steps specifically include enzymatic modifications, such 
as ubiquitinylation. See, e.g., Hershko and Ciechanover 
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(1982), "Mechanisms of Intracellular Protein Breakdown," 
Ann. Rev. Bioch. , 51:335-364. 

Other analogues include genetic variants, both 
natural and induced- Induced mutants are derived from 
5 various techniques, e.g., random mutagenesis using 

reagents such as irradiation or exposure to EMS, or 
engineered changes using site-specific mutagenesis 
techniques or other techniques of modern molecular 
biology. See, e.g., Sambrook et al. (1989) Molgcpular 

10 Cloninat A laborato ry Manual (2d ed.), CSH Press; and 

Ausubel et al. (1987 and supplements) Current Protocols 
in Molecular Biology . Greene/Wiley, New York, each of 
which is hereby incorporated herein by reference. 

As described above, the DNA-binding zinc fingers 

15 segment of a receptor shows high specificity of 

recognition of specific target DMA sequences. An 
understanding of the DNA-protein binding interactions 
provides for the modification in a rational manner of 
either ^)MA or protein characteristics, or both, to effect 

20 specif icity of binding for modulation of enhancer 

activity. More importantly, isolation of genes for new 
members of the insect steroid receptor superfamily allows 
their use to produce the receptor polypeptides and to 
isolate new controlling elements. By using the DNA- 

25 binding domains, as described above, controlling elements 

which are responsive to the ligands bound by the 
corresponding superfamily members are identified and 
isolated. This procedure shall yield a variety of 
controlling elements responsive to ligands. By the 

30 methods described above, the ligands for any particular 

member of the insect steroid receptor superfamily will be 
identified. 

The controlling elements typically are enhancers, 
but also include silencers or various other types of 
35 ligand-responsive elements. They usually operate over 

large distances, but will typically be within about 
50 kb, usually within about 35 kb, more usually within 
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about 20 kb and preferably within about 7 kb of the genes 
that these elements regulate. 

Polypeptide fragments anrt ftifj?^ 

Besides substantially full-length polypeptides, the 
present invention provides for biologically active 
fragments of the polypeptides. Significant biological 
activities include ligand-binding, DNA binding, 
immunological activity and other biological activities 
characteristic of steroid receptor superfamily members. 
Immunological activities include both immunogenic 
function in a target immune system, as well as sharing of 
immunological epitopes for binding, serving as either a 
competitor or substitute antigen for a steroid receptor 
15 epitope. 

For example, ligand-binding or DNA-binding domains 
from different polypeptides will be exchanged to form 
different or new fusion polypeptides or fragments. Thus, 
new chimaeric polypeptides exhibiting new combinations of 
specifibities result from the functional linkage of 
ligand-binding specificities to DNA-binding domains. 
This is extremely useful in the design of inducible 
expression systems. 

For immunological purposes, immunogens will 
sometimes be produced from tandemly repeated polypeptide 
segments, thereby producing highly antigenic proteins. 
Alternatively, such polypeptides will serve as highly 
efficient competitors for specific binding. Production 
of antibodies to insect steroid receptor superfamily 
30 members is described below. 

The present invention also provides for other 
polypeptides comprising fragments of steroid receptor 
superfamily members. Fusion polypeptides between the 
steroid receptor segments and other homologous or 
heterologous proteins are provided, e.g., polypeptide 
comprising contiguous peptide sequences from different 
proteins. Homologous polypeptides will often be fusions 
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between different steroid receptor superfamily members, 
resulting in, for instance, a hybrid protein exhibiting 
ligand specificity of one member and DNA-binding 
specificity of another. Likewise, heterologous fusions, 
5 derived from different polypeptides, will be constructed 

which would exhibit a combination of properties or 
activities of the parental proteins. Typical examples 
are fusions of a reporter polypeptide, e.g., luciferase, 
with another domain of a receptor, e.g., a DNA-binding 

10 domain, so that the presence or location of a desired 

ligand is easily determined. See , e.g., Dull et al., 
U.S. No. 4,859,609, which is hereby incorporated herein 
by reference- Other typical gene fusion partners include 
"zinc finger" segment swapping between DNA-binding 

15 proteins, bacterial fl-galactosidase, trpE Protein A, 

B-lactamase, alpha amylase, alcohol dehydrogenase, and 
yeast alpha mating factor. See , e.g., Godowski et al. 
(1988), Science 241:812-816; and Experimental section 
below. • 

20 

Insect steroid receptor superfamilv member expression 

With the sequence of the receptor polypeptides and 
the recombinant DNA sequences encoding them, large 
quantities of members of the insect steroid receptor 

25 superfamily will be prepared. By the appropriate 

expression of vectors in cells, high efficiency protein 
production will be achieved. Thereafter, standard 
protein purification methods are available, such as 
ammonium sulfate precipitation, column chromatography, 

3 0 electrophoresis, centrifugation, crystallization, and 

others. See, e.g., Deutscher (1990) "Guide to Protein 
Purification" Methods in Enzvmolocrv . vol 182 and others; 
and Ausubel et al. (1987 and supplements) Current 
Protocols in Molecular Biology , for techniques typically 

35 used for protein purification. Alternatively, in some 

embodiments high efficiency of production is unnecessary, 
but the presence of a known inducing protein within a 
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carefully engineered expression system is quite valuable. 
For instance, a combination of: (l) a ligand-responsive 
enhancer of this type operably linked to (2) a desired 
gene sequence with (3) the corresponding insect steroid 
receptor superfamily member will be placed together in an 
expression system provides a specifically inducible 
expression system. The desired gene sequence will encode 
a protein of interest, and the corresponding steroid 
receptor member will often be the ecdysone receptor. 
Typically, the expression system will be a cell, but in 
Yitia expression systems will also be constructed. 

The desired genes will be inserted into any of a 
wide selection of expression vectors. The selection of 
an appropriate vector and cell line depends upon the 
constraints of the desired product. Typical expression 
vectors are described in Sambrook et al. (1989) and 
Ausubel et al. (1987 and supplements) . Suitable cell 
lines are available from a depository, such as the ATCC. 
See, ATCC Catalogue of Cell Lines and Hybridomas (6th 
ed.) (istea); ATCC Cell Lines, Viruses, and Antisera, each 
of which is hereby incorporated herein by reference. The 
vectors are introduced to the desired cells by standard 
transformation or transfection procedures as described, 
for instance, in Sambrook et al. (1989). 

Fusion proteins will typically be made by either 
recombinant nucleic acid methods or by synthetic 
polypeptide methods. Techniques for nucleic acid 
manipulation are described generally, for example, in 
Sambrook et al. (1989), Molecular Cloning: a Laboratory 
Manual (2d ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory, which are incorporated herein by reference. 
Techniques for synthesis of polypeptides are described, 
for example, in Merrifield, J. Amer. Chem. Soc. 85:2149- 
2156 (1963) . 

35 Tne recombinant nucleic acid sequences used to 

produce fusion proteins of the present invention will 
typically be derived from natural or synthetic sequences. 
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Many natural gene sequences are obtainable from various 
cDNA or from genomic libraries using appropriate probes. 
See , GenBank-, National Institutes of Health. Typical 
probes for steroid receptors are selected from the 
5 sequences of Tables 1, 2 or 3 in accordance with standard 

procedures. The phosphoramidite method described by 
Beaucage and Carruthers, Tstra. Letts. 22:1859-1862 
(1981) will produce suitable synthetic DNA fragments. A 
double stranded fragment is then obtainable either by 

10 synthesizing the complementary strand and annealing the 

strand together under appropriate conditions or by adding 
the complementary strand using DNA polymerase with an 
appropriate primer sequence. 

With the isolated steroid receptor genes, segments 

15 of the transcribed segments are available as probes for 

isolating homologous sequences usually from different 
sources, e.g., different animals. By selection of the 
segment used as a probe, particular functionally 
associated segments will be isolated. Thus, for example, 

20 other nucleic acid segments encoding either ligand- 

binding or DNA-binding domains of new receptors will be 
isolated. Alternatively, by using steroid-responsive 
controlling elements as a probe, new steroid-responsive 
elements will be isolated, along with the associated 

25 segment of DNA whose expression is regulated. This method 

allows for the isolation of ligand-responsive genes, many 
of which are, themselves, also members of the insect 
steroid receptor superfamily. 

The natural or synthetic DNA fragments coding for a 

30 desired steroid receptor fragment will be incorporated 

into DNA constructs capable of introduction to and 
expression in an in vitro cell culture. Usually the DNA 
constructs will be suitable for replication in a 
unicellular host, such as yeast or bacteria, but 

35 alternatively are intended for introduction to, with or 

without integration into the genome, cultured mammalian 
or plant or other eucaryotic cell lines. DNA constructs 
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prepared for introduction into bacteria or yeast will 
typically include a replication system recognized by the 
host, the intended DNA fragment encoding the desired 
receptor polypeptide, transcription and translational 
initiation regulatory sequences operably linked to the 
polypeptide encoding segment and transcriptional and 
translational termination regulatory sequences operably 
linked to the polypeptide encoding segment. The 
transcriptional regulatory sequences will typically 
include a heterologous enhancer or promoter which is 
recognized by the host. The selection of an appropriate 
promoter will depend upon the host, but promoters such as 
the trp, lac and phage promoters, tRNA promoters and 
glycolytic enzyme promoters are known. See, Sambrook et 
al. (1989). Conveniently available expression vectors 
which include the replication system and transcriptional 
and translational regulatory sequences together with the 
insertion site for the steroid receptor DNA sequence will 
generally be employed. Examples of workable combinations 
of cell lines and expression vectors are described in 
Sambrook et al. (1989); see also r Metzger et al. (1988), 
Nature 334:31-36. 



Genetic constmni-s 

The DNA segments encoding the members of the insect 
steroid receptor superfamily will typically be utilized 
in a plasmid vector- In one embodiment an expression 
control DNA sequence is operably linked to the insect 
steroid receptor superfamily member coding sequences for 
expression of the insect steroid receptor superfamily 
member alone. In a second embodiment an insect steroid 
receptor superfamily member provides the capability to 
express another protein in response to the presence of an 
insect steroid receptor ligand. This latter embodiment 
is separately described below. The expression control 
sequences will commonly include eukaryotic enhancer or 
promoter systems in vectors capable of transforming or 
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transfecting eucaryotic host cells. Once the vector has 
been introduced into the appropriate host, the host, 
depending on the use, will be maintained under conditions 
suitable for high level expression of the nucleotide 
sequences • 

Steroid-responsive expression of selected genes 

For steroid-responsive expression of other genes, 
the steroid receptor gene will typically be cotrans formed 
with a recombinant construct comprising a desired gene 
for expression operably linked to the steroid-responsive 
enhancer or promoter element. In this use, a single 
expression system will typically comprise a combination 
of (1) a controlling element responsive to a ligand of an 
insect steroid receptor superfamily member, (2) a desired 
gene for expression, operably linked to the controlling 
element, and (3) an insect steroid receptor superfamily 
member which can bind to the controlling element. 
Usually,) this system will be employed within a cell, but 
an in vitro system is also possible. The insect steroid 
receptor superfamily member will typically be provided by 
expression of a nucleic acid encoding it, though it need 
not be expressed at high levels. Thus, in one preferred 
embodiment, the system will be achieved through 
cotransformation of a cell with both the regulatable 
construct and another segment encoding the insect steroid 
receptor superfamily member. Usually, the controlling 
element will be an enhancer element, but some elements 
work to repress expression. In this embodiment, the 
ligand for the insect steroid receptor superfamily member 
will be provided or withheld as appropriate for the 
desired expression properties. 

A particularly useful genetic construct comprises an 
alcohol dehydrogenase promoter operably linked to an 
easily assayable reporter gene, e.g., 6-galactosidase. 
In a preferred embodiment of this construct, a 
multiplicity of copies of the insect steroid receptor 
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superfamily member is used. For example, operable 
linkage of controlling elements responsive to insect 
steroid receptor superfamily members, e,g., EcR, DHR3, 
E75A and E75B, to the alcohol dehydrogenase (ADH) 
5 promoter, or others as described above, and protein 

coding sequences for a particular reporter protein, as 
described above, leads to steroid-responsive expression 
of fl-galactosidase. Such a system provides highly 
sensitive detection of expression in response to ligand 
10 binding, allowing for detection of a productive ligand- 

receptor interaction. 

DNA sequences will normally be expressed in hosts 
after the sequences have been operably linked to (i.e., 
positioned to ensure the functioning of) an expression 
control sequence. These expression vectors are typically 
replicable in the host organisms either as episomes or as 
an integral part of the host chromosomal DNA. Commonly, 
expression vectors will contain selection markers, e.g., 
tetracycline or neomycin, to permit detection of those 
cells transformed with the desired DNA sequences ( see , 
e.g., U.S. Patent 4,704,362, which is incorporated herein 
by reference) . 

£• SQll is one procaryotic host useful for cloning 
the DNA sequences of the present invention. Other 
25 microbial hosts suitable for use include bacilli, such as 

P*cj,Uttg pubtilfts , and other enterobacteriaceae, such as 
Salmonella, Seo^tia, and various Fseudomonas species. 

Other eucaryotic cells will often be used, including 
yeast cells, insect tissue culture cells, avian cells, or 
the like. Preferably, mammalian tissue cell culture will 
be used to produce the inducible polypeptides of the 
present invention (see, Winnacker, From Genes to clones . 
VCH Publishers, N. Y. (1987), which is incorporated herein 
by reference) . Mammalian cells are preferred cells in 
which to use the insect steroid receptor superfamily 
member ligand-responsive gene constructs, because they 
naturally lack the molecules which confer responses to 
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the ligands for insect steroid receptor superfamily 
members . 

Mammalian cells are preferred because they are 
insensitive to many ligands for insect steroid receptor 
5 superfamily member. Thus, exposure of these cells to the 

ligands of the insect steroid receptor superfamily 
members typically will have negligible physiological or 
other effects on the cells, or on a whole organism. 
Therefore, cells can grow and express the desired 

10 product, substantially unaffected by the presence of the 

ligand itself. The ligand will function to cause 
response either in the positive or negative direction. 
For example, it is often desirable to grow cells to high 
density before expression. In a positive induction 

15 system, the inducing ligand would be added upon reaching 

high cell density, but since the ligand itself is benign 
to the cells, the only physiological imbalances result 
from the expression, e.g., the product, itself. 
Alternatively, in a negative repression system, the 

20 ligand is supplied until the cells reach a high density. 

Upon reaching a high density, the ligand would be 
removed. Introduction of these cells into a whole 
organism, e.g., a plant or animal, will provide the 
products of expression to that organism. In this 

25 circumstance, the natural insensitivity of cells to the 

ligands will also be advantageous. 

Expression vectors for these cells can include 
expression control sequences, such as an origin of 
replication, a promoter, an enhancer and necessary 

30 processing information sites, such as ribosome-binding 

sites, RNA splice sites, polyadenylation sites, and 
transcriptional terminator sequences. Preferably, the 
enhancers or promoters will be those naturally associated 
with genes encoding the steroid receptors, although it 

35 will be understood that in many cases others will be 

equally or more appropriate. Other preferred expression 
control sequences are enhancers or promoters derived from 
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viruses, such as SV40, Adenovirus, Bovine Papilloma 
Virus, and the like. 

Similarly, preferred promoters are those found 
naturally in iamunoglobulin-producing cells (£ee u S 
Patent No. 4,663,281, which is incorporated herein by' 
reference,, but SV40, polyoma virus, cytomegalovirus 
(human or murine) and the LTR from various retroviruses 
(such as murine leukemia virus, murine or Rous sarcoma 
virus and HIV) are also available. s^, finh^s^nd 
fiukaryotic nwtf Exprpssion , cold Spring Harbor Press, 
N.Y., 1983, which is incorporated herein by reference 

The vectors containing the DNA segments of interest 
(e.g., the steroid receptor gene, the recombinant 
steroid-responsive gene, or both) can be transferred into 
the host cell by well-known methods, which vary depending 
on the type of cellular host. For example, calcium 
chloride transfection is commonly utilized for 
procaryotic cells, whereas calcium phosphate treatment is 
often used for other cellular hosts. See, generally, 
Sambrook* et al. (1989), MPlecular clonic » t^-^. 
Hamial (2d ed.), cold spring Harbor Press, Ausubel et al 
(1987 and supplements) Cement Pr-n^ is in mo!^,^. 
liologv., Greene/Wiley, New York; and Potrykus (1990) 
"Gene Transfer to. Cereals: An Assessment," 
BWTechnoTocfy 8:535-542, each of which is incorporated 
herein by reference, other transformation techniques 
mclude electroporation, DEAE-dextran, aicroprojectile 
bombardment, lipofection, microinjection, and others. 
The term "transformed cell" is nea nt to also include the 
progeny of a transformed cell. 

As with the purified polypeptides, the nucleic acid 
segments associated with the ligand-binding segment and 
the DNA-binding segment are particularly useful. These 
gene segments will be used as probes for screening for 
new genes exhibiting similar biological activities 
though the controlling elements of these genes are'of 
equal importance, as described below. 
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Many types of proteins are preferentially produced 
in eucaryotic cell types because of abnormal processing 
or modification in other cell types. Thus, mammalian 
proteins are preferably expressed in mammalian cell 
cultures. Efficient expression of a desired protein is 
often achieved, as described above, by placing: (1) a 
desired protein encoding DNA sequence adjacent to 
controlling elements responsive to ligands for insect 
steroid receptor superfamily members and an appropriate 
promoter. Cyclic pulses of ligands in a cell culture may 
provide periods for cells to recover from effects of 
production of large amounts of exogenous protein. Upon 
recovery, the ligand will often be reinduced. 

Additional steroid responsive gene elements have 
15 also been isolated, e.g., substantially purified, using 

the techniques of the present invention. Other genes 
adjacent to, and operably linked to, steroid responsive 
gene control elements are selectable by locating DNA 
segments j to which steroid receptors specifically bind or 
20 by hybridization to homologous controlling elements. For 

example, other steroid responsive genes have been 
isolated. Many of the genes which are ligand-responsive 
will also be new members of the insect steroid receptor 
superfamily. 

25 Having provided for the substantially pure 

polypeptides, biologically active fragments thereof and 
recombinant nucleic acids comprising genes for them, the 
present invention also provides cells comprising each of 
them. By appropriate introduction techniques well known 

30 in the field, cells comprising them will be produced. 

See , e.g., Sambrook et al. (1989). 

In particular, cells comprising the steroid 
responsive controlling elements are provided, and 
operable linkage of standard protein encoding segments to 

35 said controlling elements produce steroid responsive 

systems for gene expression. Cells so produced will 
often be part of, or be introduced into, intact 
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organisms, for example, plants, insects (including 
caterpillars and larvae), and higher animals, e.g., 
mammals. This provides for regulatable expression of 
desired genes where the regulating ligand has no other 
effects on the cells because the cells otherwise lack the 
receptors and responsive genes. For example, plants will 
be induced to fruit at desired times by administration of 
the appropriate ligand, or animals will be ligand- 
responsive in production of particular products. And, in 
fact, biochemical deficiencies may be overcome by ligand- 
responsive expression of cells introduced into an intact 
organism which, itself, also otherwise lacks genes 
responsive to the presence of such a ligand. Multiple 
repeats of the control elements will lead, often, to at 
least additive or synergistic control. Cells containing 
these expression systems will be used in gene therapy 
procedures, including in humans. 

Once a sufficient quantity of the desired steroid 
receptor polypeptide has been obtained, the protein is 
useful for many purposes. A typical use is the 
production of antibodies specific for binding to steroid 
receptors. These antibodies, either polyclonal or 
monoclonal, will be produced by available in vitro or in 
vivo techniques. 

For production of polyclonal antibodies, an 
appropriate target immune system is selected, typically a 
mouse or rabbit. The substantially purified antigen is 
presented to the immune system in a fashion determined by 
methods appropriate for the animal and other parameters 
well known to immunologists. Typical sites for injection 
are in the footpads, intramuscularly, intraperitoneally, 
or intradermally. Of course, another species will often 
be substituted for a mouse or rabbit. 

An immunological response is usually assayed with an 
immunoassay. Normally such immunoassays involve some 
purification of a source of antigen, for example, 
produced by the same cells and in the same fashion as the 
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antigen was produced. The immunoassay will generally be 
a radioimmunoassay, an enzyme-linked assay (ELISA) , a 
fluorescent assay, or any of many other choices, most of 
which are functionally equivalent but each will exhibit 
advantages under specific conditions. 

* Monoclonal antibodies with affinities of 10 M~ 
preferably 10 9 to 10 10 , or stronger will typically be 
made by standard procedures as described, e.g., in Harlow 
and Lane (1988) , firi+A bodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory; or Goding (1986), Mopoplgn^l 
Antibodies; Principles and P ractice (2d ed) Academic 
Press, New York, which are hereby incorporated herein by 
reference. Briefly, appropriate animals will be selected 
and the desired immunization protocol followed. After 
the appropriate period of time, the spleens of such 
animals are excised and individual spleen cells fused, 
typically, to immortalized myeloma cells under 
appropriate selection conditions. Thereafter the cells 
are clonally separated and the supernatants of each clone 
are tested for their production of an appropriate 
antibody specific for the desired region of the antigen. 

Other suitable techniques involve In Yjtro exposure 
of lymphocytes to the antigenic polypeptides or 
alternatively to selection of libraries of antibodies in 
phage or similar vectors. See, Huse et al., (1989) 
"Generation of a Large Combinatorial Library of the 
Immunoglobulin Repertoire in Phage Lambda," Science 
246:1275-1281, hereby incorporated herein by reference. 

The polypeptides and antibodies of the present 
invention will be used with or without modification. 
Frequently, the polypeptides and antibodies will be 
labeled by joining, either covalently or non-covalently, 
a substance which provides for a detectable signal. A 
wide variety of labels and conjugation techniques are 
known and are reported extensively in both the scientific 
and patent literature. Suitable labels include 
radionuclides, enzymes, substrates, cof actors, 
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inhibitors, fluorescein, chemiluminescers, magnetic 
particles and the like. Patents, teaching the use of 
such labels include U.S. Patent Nos. 3,817,837; 
3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241. Also, recombinant, chimeric, or 
humanized, immunoglobulins will be produced, see, e.g., 
Cabilly, U.S. Patent No. 4,816,567; Jones et al., 1986, 
Nature 321, 522-526; and published UK patent application 
No. 8707252; each of which is hereby incorporated herein 



Another use of purified receptor polypeptides is for 
determination of the structural and biosynthetic aspects 
of the polypeptides. Structural studies of interactions 
of the ligand-binding domains with selected ligands are 
performed by various methods. The preferred method for 
structural determination is X-ray crystallography but may 
include various other forms of spectroscopy or 
chromatography. See, e.g.., Connolly, M.L., J. Aopl. 
Crystal!^ , 16:548 (1983); Connolly, M.L. , Science 221:709 
(1983) ; and Blundell and Johnson (1976) Protein 
CrystallQcryaphy t Academic Press, New York; each of which 
is hereby incorporated herein by reference. For example, 
the structure of the interaction between hormone ligand 
and hormone-binding segments is determined to high 
resolution. From this information, minor substitutions 
or modifications to either or both of the ligand and 
ligand-binding segment are made based upon, e.g., the 
contact regions between the two. This information 
enables the generation of modified interactions between a 
ligand and its binding segment to either increase or 
decrease affinity of binding and perhaps increase or 
decrease response to binding. Likewise, the interaction 
between the zinc finger DNA-binding segments with the 
specific nucleic acid-binding sequence will be similarly 



As a separate and additional approach, isolated 
ligand-binding polypeptide domains will be utilized to 
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screen for new ligands. Binding assays will be 
developed, analogous, e.g., to immunoassays . This 
procedure permits screening for new agonists or 
antagonists of a particular steroid receptor. Isolated 
DNA-binding segments will be used to screen for new DNA 
sequences which will specifically bind to a particular 
receptor-binding segment. Typically, these receptor- 
specific binding sites will be controlling elements for 
steroid responsive genes. Thus, having isolated these 
DNA-binding sequences, genes which are responsive to the 
binding of a given receptor can be isolated. This 
provides a method for isolating genes which are 
responsive to induction or inhibition by a given hormone 
receptor. 

In another aspect of the present invention, means 
for disrupting insect development are provided where new 
ligand agonists or antagonists are discovered. These 
compounds are prime candidates as agonists or antagonists 
to interfere with normal insect development. By 
application of new steroid analogues of ligands for 
insect steroid receptor superfamily members, it is 
possible to modify the normal temporal sequence of 
developmental events. For example, accelerating insect 
development will minimize generation time. This will be 
very important in circumstances where large numbers of 
insects are desired finally, for instance, in producing 
sterile males in Mediterranean fly infestations. 
Alternatively, it is useful to slow development in some 
pest infestations, such that the insects reach 
destructive stages of development only after commercial 
crops have passed sensitive stages. 

In another commercial application, ligands 
discovered by methods provided by the present invention 
will be used in the silk-production industry. Here, the 
silkworms are artificially maintained in a silk-producing 
larvae stage, thereby being silk productive for extended 
time periods. The development of larvae will also be 
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susceptible to acceleration to silk-production in a 
shorter time period than than naturally. 

Other analogues of ligands for insect steroid 
receptor super family members will be selected which, upon 
application, will completely disrupt normal development 
and, preferably, lead to a lethal result. Slightly 
modified natural substances will often have greater 
specificity of action and much higher activities, 
allowing for lower levels of application. For example, 
more lipophilic ligands are more readily absorbed 
directly into the insect surface or cuticle. Thus, 
extremely low concentrations of natural ligands should be 
effective in controlling pests. Furthermore, many of 
these ligands are likely to be relatively easily 
manufactured, taking advantage of enzymatic production 
methods. New ligands for insect steroid receptor 
superf amily members will sometimes be more species 
specific or will exhibit particularly useful 
characteristics, for example, being lethal to specific 
harmful insects. The greater specificity of the hormones 
will allow avoidance of the use of non-specific 
pesticides possessing undesired deleterious ecological 
side effects, e.g., pesticide residue accumulation in 
food, often having deleterious effects on humans. 
Furthermore, compounds having structures closely 
analogous to natural compounds should be susceptible to 
natural mechanisms of biological degradation. 

Another aspect of the present invention provides for 
the isolation or design of new gene segments which are 
responsive to ligands for insect steroid receptor 
superf amily members. For example, use of the nucleic 
acids to screen for homologous sequences by standard 
techniques will provide genes having similar structural 
features. Similarly arranged intron structures will 
typically be characteristic of larger superfamily 
categories. The preferred domains for screening will be 
the ligand-binding or DNA-binding segments, although the 
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DNA segments which are recognized by the DNA-binding 
domains, i.e., the controlling elements, will also be of 
particular interest. Screening for new controlling 
elements will usually take advantage of known 
5 similarities, e.g., sequence homology to other known 

elements, or homology to the DNA zinc finger-binding 
domains of other receptors. Receptors and genes 
important in the general developmental sequence of 
expression will be discovered. Using this set of 
10 developmentally regulated genes will allow selection of 

particular molecules which are responsible for 
controlling expression of developmentally regulated 
genes . 

Kits for the determination of expression levels of 
15 the nucleic acids and proteins provided herein are made 

available. Typically, the kit will have at least one 
compartment which contains a reagent which specifically 
binds to the desired target molecule, e.g., ligand 
analogues, receptors, or nucleic acids. These reagents 
20 will be f used in techniques for assays, e.g., using 

methods typically used in screening protocols. See, 
e.g., Sambrook et al. (1989) and Ausubel et al. (1987 and 
supplements) . 

The following experimental section is offered by way 
25 of example and not by limitation. 

ftXPERTMENTAL 
EXAMPLE I 

CLONING STRUCTURE AND EXPRESSION OF THE pROSOPHILA 
30 £75 GENE THAT ENCODES TWO MEMBERS OF THE STEROID 

RECEPTOR SUPERFAMILY. 

The following experiments demonstrate that the E2S 
gene encodes two members of the steroid receptor 
35 superfamily. The proteins it encodes share amino acid 

sequence homology with the conserved DNA-binding and 
ligand-binding domains of this superfamily. The EZS gene 
is ecdysone-inducible, and it occupies and causes the 
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ecdysone-inducible early puff at the 75B locus in the 
Prosophila polytene chromosome, 

A, Cloning of Genomic DNA Encompassing the 
Ecdvsone- Inducihlft 75B Puff T^ ig 

We have used the method of chromosomal walking 
(Bender, W. , P. Spierer, and D. S. Hogness, 1983. 
Chromosomal walking and jumping to isolate DNA from the 
Ace and rosy loci and the Bithorax complex in Drosonhn* 
melanoq^ster . J T Mpl. 9^1, 168:17-33) to isolate the 
genomic DNA encompassing the 75B puff region. The 
starting point for the walk was a genomic clone, 
designated A8253, which had been localized by in situ 
hybridization to the proximal end of 75B. Isolated 
restriction fragments of A8253 were used to screen a 
library of genomic DNA from the Canton S (c 5 ) strain of 
fl. melanoqaster . See (Maniatis, T. , R. c. Hardison, E. 
Lacy, J. Lauer, c. O'Connell, D. Quon, G. K. Sim, and 
A. Efstradiatis, 1978, "The isolation of structural genes 
from libraries of eucaryotic DNA." Cell 15:687-701). 
Genomic clones AcDm3504 and AcDm3505 were isolated by 
homology to A8253. 

The walk was then extended in both directions until 
-100 kb of genomic DNA had been isolated, and the 
25 orientation of the walk was determined by in situ 

hybridization of the terminal segments to polytene 
chromosomes. Thereafter, the walk was extended in the 
rightward direction on the molecular map, or distally 
relative to the centromere. The 350 kb of genomic DNA 
30 encompassed by the walk corresponds to the chromosomal 

region between bands 75A6-7 and 75B11-13, as determined 
by in situ hybridization. This region includes the 75B 
puff, which appears to initiate by simultaneous 
decondensation of chromosomal bands 75B3-5 and then 
35 spreads to surrounding bands. 

Methods 

Genomi c DNA librari ? g 
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Canton S genomic DNAs were isolated from a library 
of sheared, EcoRI-linkered Canton S DNA cloned into the 
Charon 4 A phage vector. See (Maniatis, T., 
R. C. Hardison, E. Lacy, J. Lauer, C. O'Connell, D. Quon, 
5 G. K. Sim, and A. Efstradiatis, 1978. The isolation of 

structural genes from libraries of eucaryotic DNA. Cell 
15:687-701). 0 r genomic DNAs were isolated from a library 
of sheared DNA, GC-tailed into the sep6 A vector. See 
(Meyerowitz, F. M. , and D. S. Hogness, 1982. "Molecular 

10 organization of a nrosophila puff site that responds to 

ecdysone." Cell 28:165-176). One step in the 
chromosomal walk was taken using a cosmid library of 
SauIIIA partially digested O r DNA cloned into the cosmid 
pl4Bl by the method of Ish-Horowicz and Burke (Ish- 

15 Horowicz> D. , and J. F. Burke, 1982. Rapid and efficient 

cosmid cloning. wii«i«ie Aci ds Res. 9:2989-2998). 

m situ hy bridization 

In .situ hybridization to polytene chromosomes was 
carried out with DNA probes that were nick-translated in 
the presence of ^-labeled TTP (NEN) , as described by 
Bonner and Pardue (Bonner, J. J., and M. L. Pardue, 1976. 
Ecdysone-stimulated RNA synthesis in imaginal discs of 
prosoohila mslanoaaster . Assay by in situ hybridization. 
chromosoma 58:87-99), with the following modifications: 
Heat and RNAase treatments of the slides were omitted, 
and hybridization and washing were at 63 'C in 2XSSPE for 
18 and 2 hours, respectively. 

B. Identification of a 50 kb Region of Cloned 

Genomic DNA that Contains Sequences Homologous 
to Ecdysone-induc ^ transcripts . 

Restriction fragments of the above genomic clones 
were tested for their ability to hybridize with each of 
two cDNA probes, one derived from the RNA in ecdysone- 
induced cells, and the other from the RNA in noninduced 
cells. Two differential screens were carried out. In 
the first, genomic DNA covering the entire 350 kb walk 
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was examined with cDNA probes synthesized with reverse 
transcriptase from an oligo(dT) primer annealed to 
poly (A) + RNA. The poly (A) + RNA was prepared from total 
inner tissues that were mass-isolated from late third 
instar larvae and incubated in the presence of ecdysone 
plus cycloheximide, or cycloheximide alone. (See 
Methods, below. Cycloheximide was included because 
higher levels of ecdysone-induced transcripts accumulate 
in its presence.) 

Each of the 32 P-labeled cDNA probes made from these 
two poly (A) + RNAs was applied to one of two duplicate 
Southern blots that contained, in addition to the genomic 
fragments from the walk, a control DNA consisting of 
sequences from the ribosomal protein 49 gene (O'Connell, 
P., and M. Rosbash, 1984. Sequence, structure and codon 
preference of the prospphU^ ribosomal protein 42 gene. 
Fugleic Acjds 3es f 12:5495-5513), which was used to 
normalize the hybridization intensities of the duplicate 
blots. This screen revealed sequences specific to 
ecdysone-induced RNAs only within the AcDm3522 genomic 
clone that is centered at approximately +220 kb on the 
molecular map. 

Because the above probes will preferentially detect 
sequences near the 3' termini of the RNAs, particularly 
in the case of long transcripts, a second differential 
screen was carried out with cDNA probes primed with 
random hexamers (see Methods, below) . This screen, which 
was restricted to the 135 kb of genomic DNA between +105 
kb and +240 kb, revealed ecdysone-inducible sequences in 
fragments spread out over an -50 kb region between +170 
kb and +220 kb. This region represents the £75 gene. 

Methods 

Organ cul ture and RNA isolating 

Late third instar 0 r larvae were harvested, washed in 
0.7% NaCl, resuspended in Robb's phosphate-buffered 
saline (PBS) (Robb, J. A., 1968. Maintenance of imaginal 
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discs of nrosophila m»mnogaster in chemically defined 
media, -t r«n ■ Biol. 41:876-885) , preaerated with a 
blender, and passed through a set of rollers to extrude 
the organs. This "grindate" was filtered through a 
coarse Nitex screen to remove carcasses, and settled five 
times (3-5 minutes per settling) by gravity to remove 
floating and microscopic debris. Isolated tissues 
(primarily salivary glands, imaginal discs, gut, and 
Malphigian tubules) were cultured at 25 'C in plastic 
petri dishes in aerated Robb's PBS. fl-ecdysone (Sigma) 
(0.2 /il/ml of 10 mg/ml) in ethanol and/or cycloheximide 
(2 Ml/ml of 35 mM) in water was added to the appropriate 
cultures. Incubations in the presence of cycloheximide 
were for -8 hours. Isolated tissues were homogenized in 
10 volumes of 6 M guanidine-HCl/0. 6 M sodium acetate 
(pH 5.2), centrifuged at 5000 g for 10 minutes to remove 
debris, and layered onto a 5.7 M CaCl shelf, as described 
previously (Chirgwin, J. M. , A. E. Przbyla, 
R. J. MacDonald, and W. J. Rutter, 1979. Isolation of 
biologically active ribonucleic acid from sources 
enriched in ribonuclease. Biochemistry 18:5294-5299). 
Poly(A)+ RNA was purified by oligo(dT) chromatography. 

Southern blot a nalysis 

Southern blots were performed on nitrocellulose, as 
described previously (Segraves, W. A., C. Louis, S. 
Tsubota, P. schedl, J. M. Rawls, and B. P. Jarry, 1984. 
The rudimentary locus of prosophjla melanoqaster . 
Mol. Biol. 175:1-17) . cDNA probes were prepared by 
reverse transcription (AMV reverse transcriptase; 
Seikagaku) of 2 ng of poly (A) + RNA with 700 ng of 
oligo(dT) m6 (Collaborative Research) or 15 fig of random 
hexamers (Pharmacia) in a 20 fil reaction mixture 
containing 80 mM Tris CI (pH 8.3 at 42 -C), 10 mM MgCl 2 , 
100 mM KC1, 0.4 mM DTT, 0.25 mM each of dATP, dGTP, and 
dTTP, and 100 nCi of [ 32 P]dCTP (800 Ci/mole; Amersham) . 
After incubation at 37 'C for 45 minutes, 80 M l of 10 mM 
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EDTA and 2 Ml of 5 N NaOH were added before incubation at 
70 # C for 10 minutes to denature the products and 
hydrolyze the RNA. After the addition of 10 pi of 1 M 
Tris-Cl (pH 7.5) and 5 Ml of l N HC1, unincorporated 
label was removed by chromatography on Biogel P60. 

C. The E75 Gene Contains Two Overlapping 
Transcription Unit s; E75A and E75B 

Northern blot analysis of ecdysone-induced and 
noninduced RNAs, prepared as described above and 
hybridized with strand-specific DNA probes derived from 
cloned restriction fragments in the 60 kb region (+166 to 
+226 kb) containing the E75 gene, demonstrated that this 
gene produces two classes of ecdysone-inducible mRNAs, 
both derived from rightward transcription. The E75A 
class of mRNAs hybridized with probes from both the 5' 
(left) and 3' (right) ends of the 50 kb E75 gene. The 
E75B class hybridized only with probes from the 3'- 
proximal 20 kb of the gene. These results suggest that 
the A arid B classes of ecdysone-inducible RNAs are 
initiated by different promoters, located about 30 kb 
apart, and that the two transcription units defined by 
these promoters overlap in the region downstream from the 
B promoter. 

This suggestion was confirmed by analysis of the 
structure of cloned cDNAs from the E75A and E75B mRNAs. 
Approximately 10 6 clones from an early pupal cDNA library 
(Poole, S. J., L. M. Kauvar, B. Drees, and T. Romberg, 
1985. The engrailed locus of Drosophila * Structural 
analysis of an embryonic transcript. Cell 40:37-40) were 
screened at low resolution with genomic DNA probes from 
the £75 gene region. The 116 cDNA clones identified by 
this screen were analyzed by restriction digestion and 
hybridization to a panel of probes derived from the 60 kb 
(+166 to +226 kb) region. One of the clones, ADm4925, 
was thereby selected as a representative of the E75A 
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class of mRNAs, and another, ADm4745, as a representative 
of the E75B mRNA class. 

The genomic regions homologous to these two cDNA 
clones were further localized by Southern blot analysis, 
and the nucleotide sequence of these regions and of both 
cDNA clones was determined. These sequences are given in 
Table l, along with those derived from 5' and 3' terminal 
sequence determinations for each transcription unit. 
These data demonstrate that the 50 kb EISA transcription 
unit consists of six exons, labeled in 5' to 3' order: 
AO, Al, 2, 3, 4 and 5, of which exons AO and Al are 
specific to this unit, while the remaining four are 
shared with the 20 kb E75B transcription unit. 
Similarly, the E75B unit contains a specific exon, 
labeled Bl, at its 5' end, which is located just upstream 
of the shared exon 2. Thus, the EZ5 gene consists of two 
transcription units, of which the shorter E75B unit 
occupies the 3' proximal 20 kb of the longer E75A unit. 
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Table l. Sequences of the E75 exons and 
flanking DNA. The sequence is that of the C s genomic 
DNA, which was identical to that of the cDNAs , except for 
the T-G change indicated at position +2691. This change 
would convert a leucine to an arginine in the protein 
sequences. The Dm4925 cDNA extends from just 5' of the 
EcoRV site at +939 to +4267 in A. The Dm4745 cDNA 
extends from +804 in B to a point near the Hindlll site 
at +4246 in A. (A) The E75 A exons and flanking DNA. The 
sequences of the AO, Al, and common exons 2-5 are 
interrupted by intron sequences (lowercase) , which are 
limited to those near the splice sites and are in 
agreement with consensus sequences for donor (5') and 
15 acceptor (3«) splice sites. Negative numbers at the 

right end of each line refer to the number of base pairs 
upstream of the E75 A initiation site, positive numbers 
refer to positions in the E75 A mRNAs, continuing into 
the 3' flanking DNA. Numbers at the left end of each 
20 line refer to amino acid residues in the E75 A protein. 

The underlined 14 bp sequence at -159 to -172 exhibits a 
13/14 bp match to a sequence (CGTAGCGGGTCTC) found 47 bp 
upstream of the ecdysone-inducible E74 A transcription 
unit responsible for the early puff at 74EF. This 
25 sequence represents the proximal part of a 19 bp sequence 

m the E74 A promoter that binds the protein encoded by 
the £. melanoggpter 2este gene. Another underlined 
sequence- in the E75 A promoter at -74 to -82 is also 
found in) the E75 B promoter, where it is part of a 
30 tandemly repeated octanucleotide (GAGAGAGC) located at 

-106 to -121 in B. This repeat matches the consensus 
sequence for the binding sites of the GAGA transcription 
factor which also binds to the E74 A promoter. Other 
underlined sequences represent, at -27 to -33, the best 
35 match to the TATA box consensus at an appropriate 

position, three AUG codons that are closely followed by 
in- frame stop codons in the 5 1 -leader sequence of the E75 
mRNAs, and alternative polyadenylation-cleavage signals 
at 4591 and 5365 that are used by both E75 A and E75B 
40 mRNAs. (B) The Bl exon and its 5» -flanking DNA. The 

numbering at the right and left ends of the lines follows 
the same convention as in A. Exons 2-5 shown in A are 
also used in E75 B, but the amino acid residues and base 
pair numbers shown in A must be increased by 157 and 375, 
45 respectively, to apply to the E75 B protein and mRNA. 

The first ten nucleotides of the 136-nucleotide E75 
B-intron linking the Bl exon to Exon 2 are gtaggttag, 
whereas the last ten are shown upstream of nucleotide 
1178 in A. The underlined sequences represent, in order, 
50 the region of homology to a sequence upstream of E75 A, 

noted above, the best match to the TATA box consensus 
at "21 to -27, and three AUG codons followed by in-frame 
stop codons in the 5 1 leader of the E75 B mRNA. 

Panels 1 and 2 are shown in detail in panels 3-8, 
55 and 9-10, respectively. 




WO 91/13167 



PCI7US91/01189 



49 



10 



15 



20 



25 



30 



Methods 

cdna libraries 

The ADm4925 and ADm4745 cDNAs were isolated from an 
O r early pupal cDNA library in AgtlO (Poole, S. J., 
L. K. Kauvar, B. Drees, and T. Kornberg, 1985. The 
engrailed locus of Drosophila : structural analysis of an 
embryonic transcript. Cell 40:37-40). The two cDNAs 
(ADm4927 and ADm4928) that were used for 3' -end mapping 
were isolated from an ecdysone-induced salivary gland 
cDNA library in A607 prepared by C. W. Jones. (Our 
strain collection names for the cDNA clones used in these 
studies are AfDm4925, AfDm4745, AeDm4927, and AeDm4928.) 

Northern blot analysis 

Probes to be used for Northern blots were cloned 
into the vector p#X (from R. Mulligan) , containing the 
0X174 origin of replication cloned in between the Hindlll 
and BamHI sites of pBR322. This allowed the synthesis of 
single-itranded probe DNA (Arai, K. , N« Arai, J. 
Schlomai, and A. Kornberg, 1980. Replication of duplex 
DNA of phage 0X174 reconstituted with purified enzymes. 
Proc. Natl. Acad. Sci. 77:3322-3326), which was performed 
by the incubation of .supercoiled plasmid DNA with gene A 
protein , rep and ssb proteins, and DNA polymerase III 
holoenzyme in a reaction containing 20 mM Tris CI 
(pH 7.5), 80 /xg/ml BSA, 4% glycerol, 20 mM DTT, 1 mM ATP, 
16 mM concentrations of the three unlabeled 
deoxynucleotides and 1.6 mM concentrations of the labeled 
deoxynucleotide for 1 hour at 30 B C. EDTA was then added 
to 20 mM, SDS to 0.1%, and proteinase K to 50 nq/ml. The 
reactions were digested for 30 minutes at 37 # C, and 
unincorporated label was removed by gel filtration. 

Si nuclease protection and primer extension analysis 

Single-stranded probes, prepared as described above 
by the *X in vitro replication system, were purified by 
electrophoresis on low melting point agarose gels for use 
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as SI probes. All other probes were prepared by 
extension of the -20, 17-mer sequencing primer (New 
England Biolabs) on single-stranded M13mp (Messing, J., 
1983. New M13 vectors for cloning. Methods Knzvmol . 
101:20-78) or pEMBL (Dente, L. , G. Cesareni, and R. 
Cortes, 1983. pEMBL: A new family of single-stranded 
plasmids. Nucleic Acids Res, 11:1645-1654) recombinant 
templates using 32 P-labeled nucleotides, followed by 
cleavage with the appropriate restriction enzyme and 
purification of the probe on denaturing polyacrylamide 
gels. Labeled probe (100,000-300,000 cpm) was incubated 
with 1 fxq of poly (A) + RNA in a 5 /xl reaction mixture 
containing 5 /xg of yeast tRNA, 0.4 M NaCl, 40 mM PIPES 
(pH 6.8), and 1 mM EDTA at 60 'C under oil. Reactions 
were cooled and diluted 1:10 into either SI digestion or 
primer extension buffer. SI nuclease digestions were 
performed in 50 mM acetate buffer (Na) , 400 mM NaCl, and 
4 mM 2nS0 4 at 20 *C for 1 hour with -15 - 150 Vogt units of 
SI nuclease (Boehringer) per 50 fil reaction. Primer 
extensions were performed at 42 *C in 50 mM Tris CI 
(pH 8.3 at 42 # C), 80 mM KC1, 2 mM DTT, 1 mM of dATP, 
dCTP, dGTP, and dTTP, with 20 units of AMV reverse 
transcriptase (Seikagaku) per 50 Ml reaction. Reactions 
were terminated by the addition of EDTA, tRNA carrier was 
added to the SI nuclease digestions, and samples were 
ethanol-precipitated and either electrophoresed directly 
on 5% or 6% denaturing polyacrylamide gels or glyoxalated 
(McMaster, G. K. , and G. C. Carmichael, 1977. Analysis 
of single and double-stranded nucleic acids on 
polyacrylamide and agarose gels by using glyoxal and 
acridine orange. Proc. Natl . Acad. sd . 74:4835-4838) 
and electrophoresed on 1% agarose gels run in 10 mM 
sodium phosphate buffer (pH 6.8), 

DNA sequence analysis 

The cDNA clones ADm4927 and ADm4928 were sequenced 
by chemical degradation (Maxam, A. M., and W. Gilbert, 
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1980. Sequencing end-labeled DNA with base-specific 
chemical cleavage. Methods Enzvmol. 65:499-560). All 
other sequencing was performed using the 
dideoxynucleotide chain termination method (Sanger, F., 
5 A. R. Coulson, B. F. Barrell, A. J. H. Smith, and 

B. A, Roe, 1980. Cloning in single^stranded 
bacteriophage as an aid to rapid DNA sequencing. J. Mol. 
Biol. 143:161-178). Fragments were cloned into M13mp 
vectors (Messing, J., 1983. New M13 vectors for cloning. 

10 Methods Enzvmol. 101:20-78) or pEMBL (Dente, I*. , 

G. Cesareni, and R. Cortes, 1983. pEMBL: A new family of 
single-stranded plasmids. Nucleic Acids Res. 11:1645- 
1654) and sequenced directly or following the generation 
of a set of overlapping deletions using exonuclease III 

15 (Henikoff, S., 1984. Unidirectional digestion with 

exonuclease III creates targeted breakpoints for DNA 
sequencing. Gene 28:351-359). Sequencing was performed 
on both strands of the ADm4925 cDNA, the B-specific 
region c}f ADm4745 cDNA, the A- and B-specific 5 1 genomic 

20 regions not represented in the cDNAs, and the 3' -flanking 

region. The remaining exon boundaries of ADm4745 and 
genomic regions represented within the cDNA clones were 
sequenced on one strand. 

25 D. The E75 Gene Encodes Two Members of the Steroid 

Receptor Superfamilv 

The coding and noncoding sequences of the E75 A and 
B mRNAs, their splice junctions, and the 5' and 3' 

30 flanking sequences are shown in Table 1. Certain 

sequences of potential interest within the 5' flanking 
DNA and in the 5' leader mRNA sequences are indicated in 
the legend to Table 1. We focus here on the large open 
reading frames of the E75 A and B mRNAs that begin at 

35 380 bp and 284 bp downstream from their respective mRNA 

start sites, each continuing into the common final exon. 
The termination codon in exon 5 lies upstream of both 
alternative polyadenylation sites; thus, the sequence of 
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the encoded protein is not affected by which site is 
selected. Since the open reading frames in the E75 A and 
B mRNAs begin in the AO and Bl exons and merge at the 
beginning of exon 2, the proteins encoded by the two 
transcription units differ in the amino-terminal region 
and are the same in the carboxy-terminal region. The 
specific amino-terminal regions contain 266 and 423 amino 
acid residues in the E75 A and B proteins, respectively, 
while their common carboxy-terminal region consists of 
971 residues. The predicted molecular weights of the A 
and B proteins are thus 132,000 : and 151,000. The open 
reading frames display characteristic fi. melanoaaster 
codon usage, and their extents have been confirmed by in 
vitro translation of mRNAs transcribed An vitro from cDNA 
constructs and by expression of fusion proteins in 
£• fioli. The predicted protein sequence for each protein 
is punctuated by homopolymeric tracts of amino acids, 
which are noted in Table 1 and its legend. 

Analysis of the sequences of E75 proteins and 
comparison to the sequences of known proteins have 
revealed similarity between the E75 proteins and members 
of the steroid receptor superfamily (Evans, R. M. , 1988. 
The steroid and thyroid hormone receptor superfamily. 
Science 240:889-895;- Green, S., and P. Chambon, 1988. 
Nuclear receptors enhance our understanding of 
transcription regulation . Trends in g«ii*ir« 4 : 3 09-3 14 ) . 
We have used the nomenclature of Krust el al. (Krust, A., 
S. Green, P. Argos, v. Kumar, P. Walter, J. Bornert, and 
P. Chambon, 1986. The chicken oestrogen receptor 
sequence: Homology with v-erbA and the human oestrogen 
and glucocorticoid receptors. EMBQ J. 5:891-897) in 
dividing the proteins into six regions, A to F, in the 
amino- to carboxy-terminal direction. 

Similarity between E75A and other members of this 
superfamily is strongest in the c region, a cysteine- 
lysine-arginine-rich region that is necessary and 
sufficient for the binding of these receptors to DNA (for 
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review, see, Evans, R. M. , 1988. The steroid and thyroid 
hormone receptor superf amily . Science 240:889-895; 
Green, S., and P. Chambon, 1988. Nuclear receptors 
enhance our understanding of transcription regulation. 
Trends in Genetics 4:309-314). The C region consists of 
66-68 amino acids, of which 20 residues are invariant 
within this family. Among these are nine invariant 
cysteine residues, eight are believed to coordinate zinc 
in the formation of two zinc finger-like structures 
(Miller, J. , A. D. McLachlan, and A. Klug, 1985. 
Representative zinc-binding domains in the protein 
transcription factor IIIA from Eenopus oocytes. gMBO J. 
4:1609-1614; Freedman, L. P., B. F. Luisi, Z. R. Korszun, 
R. Basavappa, P. B. Sigler, and K. R. Yamamoto, 1988. 
The function and structure of the metal coordination 
sites within the glucocorticoid receptor DKA binding 
domain. Nature 334:543-546; Severne, Y. , S. Wieland, 
W. Schaffner, and S. Rusconi, 1988. Metal binding finger 
structure of the glucocorticoid receptor defined by site- 
directed mutagenesis . EMBO J. 9 : 2503-2508 ) . Within the 
C region, E75A contains all of the highly conserved 
residues and is approximately as closely related to other 
members of the steroid receptor superfamily as they are 
to one another. The closest relative of E75 appears to 
be the human ^ar-1 gene, which has nearly 80% amino acid 
identity to E75 A in the DNA-binding domain. 

The other region conserved among members of the 
steroid receptor superfamily is the E region, which is 
required for steroid binding and for the linkage of 
steroid-binding and t^ans-activation functions (for 
review, see, Evans, R. M. , 1988. The steroid and thyroid 
hormone receptor superfamily. Sciepce 240:889-895; 
Green, S., and P. Chambon, 1988. Nuclear receptors 
enhance our understanding of transcription regulation. 
T pp nrt 5 in Genetics 4:309-314). Although overall E-region 
similarity is clearly significant for the comparison of 
E75 A to the thyroid hormone, vitamin D, and retinoic 
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acid receptors, and ear-1, similarity to the 
glucocorticoid and estrogen receptors is considerably 
lower. However, the plots of local similarities show a 
clear similarity to each of these proteins within three 
5 subregions of the E region, denoted El, E2 and E3. The 

El subregion is the most highly conserved and corresponds 
to a region shown by in vitro mutagenesis to be essential 
for steroid binding and steroid-dependent trans - 
activation (Giguere, V., S. M. Hollenberg, 

10 M - G * Rosenfield, and R. M. Evans, 1986 . Functional 

> domains of the human glucocorticoid receptor, cell 

46:645-652; Danielson, M. , J. p. Northrop, J. Jonklaas, 
and G. M. Ringold, 1987. Domains of the glucocorticoid 
receptor involved in specific and nonspecific 

15 deoxyribonucleic acid binding, hormone activation and 

transcriptional enhancement. Mol. Endocrinol. 1:816- 
822). Region E2 is less highly conserved in primary 
amino acid sequence, but can, in part, be seen as a 
conserved hydrophobic region in the hydropathy plots of 

20 several* of these proteins. A deletion of 14 amino acids 

within this region abolished steroid binding 
(Rusconi, s., and K. R. Yamamoto, 1987. Functional 
dissection of the hormone and DNA binding activities of 
the glucocorticoid receptor. EMBO J. 6:1309-1315). E3 

25 falls close to the end of the region that is absolutely 

required for steroid binding. 

While the characteristic structural features of the 
steroid receptor superfamily are well conserved in E75 . 
two novel variations are seen. The first of these 

30 concerns the structure of the E75 B protein, which 

contains a major alteration within its putative DNA- 
binding domain. The steroid receptor superfamily DNA- 
binding domain consists of two DNA-binding zinc fingers 
separated by a less conserved linker region. In £75, as 

35 in nearly all other genes of this family, an intron is 

found between the two fingers, in £75, this splice marks 
the beginning of the region held in common between the 
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E75 A and B proteins. This results in the E75 A protein 
having two fingers, while the E75 B protein has unrelated 
B-specific sequences in place of the first finger. Other 
sequences within the B-specific amino-proximal region may 
5 contribute to the DNA-binding domain of the E74B protein. 

Alternatively, the B protein might bind DNA with 
only one finger, as GAL4 transcription factor of yeast 
appears to do. It is possible that these structural 
differences imply a functional difference in the DNA- 

10 binding properties of the E75 A and B proteins that might 

allow them to differentially regulate the transcription 
of the late genes that characterize the secondary 
response to ecdysone in different target tissues. 

In this respect, it should be emphasized that the 

15 putative hormone- or ligand-binding domain is represented 

by the E region that is common to the E75A and E75B 
proteins- Thus, these proteins appear to be receptors 
for the same hormone, which may regulate the 
transcription of different sets of genes. These proteins 

20 represent "orphan" receptors, in that their hormone, or 

binding ligand, has not yet been identified. Because 
ecdysteroids are the only known steroid hormones in 
Drosophila . the most obvious candidate for an E75 ligand 
would be ecdysone itself. However, it is unlikely that 

25 this is the case, since the putative hormone-binding 

domain of the E75 proteins does not exhibit the high 
sequence homology to that of the known Drosophila 
ecdysone receptor encoded by the EcR gene (see 
Experimental Example III and Table 2) that would be 

30 expected if the E75 proteins were also ecdysone 

receptors. It, therefore, seems likely that the E75 
proteins would bind either a terpenoid juvenile hormone 
or a novel Drosophila hormone. 

The second unusual feature of the E75 proteins is 

35 the presence of a large F region, encompassing nearly one 

half of the proteins. Many of the other receptors have 
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very small F regions, and no function has yet been 
ascribed to this region. 

Methods 

5 Protein sequence analysis 

Sequence data were compiled using the Bionet system. 
Protein sequence comparison was performed using FASTP 
(Lipman, D. j., and w. r. Pearson, 1985. Rapid and 
sensitive protein similarity searches, gclgnss 227:1435- 
10 1441) and Bionet IFIND programs. 

E - Expressi on Vectors for E75 Proteins 
In order to express the E75 proteins, portions of 
cONAs and genomic clones were fused in order to generate 
cassettes containing the entire E75 A and E75 B protein 
coding regions. First, BamHI sites were introduced into 
genomic clones upstream of the initial AUGs of the large 
open reading frames. Then, E75 AO exon sequences were 
fused to sequences of a nearly full-length £75 A cDMA, 
20 and E75 Bl exon sequences were fused to sequences of a 

nearly full-length E75 B cDNA. These cassettes were 
cloned into pGEM3 (Promega) , and transcripts of the open 
reading frames were prepared using T7 polymerase. These 
were then translated in the presence of -methionine, 
and shown to give rise to proteins of appropriate size. 

These cassettes have been placed into a variety of 
expression vectors, including pUCHsneo/Act for expression 
in prosoph ; Ua cells, pSV2 for expression in mammalian 
cells, and pOTS for expression in bacterial cells. 
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Methods 

BamHI sites were introduced directly upstream of the 
initial ATGs of the E75A. and E75B coding sequence — at 
the Sspl site upstream of the E75A initial ATG, and at 
35 the Sacll site upstream of the E75B initial ATG. cDNA 

and genomic sequences were joined at the EcoRV site in 
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the AO exon to construct an E75A cassette, and at the 
Mlul in exon 3 to construct an E75B cassette. 

EXAMPLE II 

5 

CLONING, STRUCTURE AND EXPRESSION OF THE EcR AND 
DHR3 GENES THAT ENCODE ADDITIONAL MEMBERS OF THE 
STEROID RECEPTOR SUPERFAMILY . 

10 The following experiments were carried out after the 

primary structure of the E75 gene, and of the two members 
of the steroid receptor superfamily that it encodes, was 
determined (Experimental, Example I). The purpose of 
these experiments was to clone and determine the primary 

!5 structure of other steroid receptor superfamily genes 

from Drosophila , and of the proteins they encode. The 
aim was to identify the gene that encodes a prosophijla 
ecdysone receptor, given that the characteristics of the 
E75 gene indicated that it did not encode an ecdysone 

20 receptor. The first stage of the experimental plan was 

to use £he conserved sequences in the E75A transcription 
unit that encode the putative DNA-binding domain of the 
E75A receptor protein as a probe to screen a PyosopfrjU^ 
genomic library to identify sequences encoding the 

25 putative DNA-binding domains of other Drosophila members 

of the steroid receptor superfamily. The second stage 
was to isolate cDNA clones corresponding to the 
identified genes, as well as additional genomic DNA 
clones, to obtain the nucleotide sequence of the complete 

30 coding region (i.e., the open reading frame encoding the 

respective receptors) and the exon-intron organization of 
these genes. 

The experiments described below resulted in the 
cloning and structural characterization of two genes that 

35 satisfy the criteria for bona fide members of the steroid 

receptor superfamily: encoding proteins that exhibit 
amino acid sequence homology to both the DNA-binding and 
the hormone-binding domains that are conserved among 
members of this superfamily. The two genes are called 
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£cB and DHBI. The £sE gene was originally called DHR23, 
but was renamed £sE after it was shown to encode an 
ecdysone receptor (see Experimental Example III) . The 
DHR3 designation stands for Drosophila Hormone 
Receptor 3, 

A, Identification and Chromosomal Mapping of EcR 
and DHR3 Genomic Clones 

Initially, Southern blots of total Drosophila 
genomic DNA, digested with one or more restriction 
endonucleases, were probed with a 530 bp fragment of the 

cDNA containing the sequences encoding the putative 
DNA-binding domain of the E75A receptor protein (see 
Experimental Example I) at low and high stringency 
hybridization conditions. 

To isolate the sequences responsible for these low 
stringency bands, this E75A probe was used to screen a 
Drosophil.a genomic library under the same low stringency 
conditions, counterscreening duplicate filters with E75 
intron probes to eliminate phage containing inserts from 
the £25 gene. Five genome equivalents were screened and 
39 non-E75 containing phage were isolated. The 25 most 
strongly hybridizing clones were divided into six classes 
on the basis of restriction patterns and cross 
hybridization, each class containing between one and six 
independent overlapping genomic inserts . 

For each class, a restriction fragment containing 
the region of hybridization to the E75A probe was 
localized by Southern blotting. Hybridization of probes 
derived from these fragments to genomic Southern blots 
showed that each of the low stringency bands detectable 
by the E75A probe could be accounted for by one of the 
six isolated fragments. 

The nucleotide sequences of the six restriction 
fragments were determined to test whether they represent 
candidate receptor genes. In all cases, DNA sequence 
similarities with the E75A probe were observed that are 
sufficient to account for the hybridization of these 
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fragments with the probe. When the DNA sequences were 
conceptually translated in all six reading frames, four 
of the fragments yielded no significant sequence 
similarity with E75A at the protein level. The remaining 
two clones , however, showed predicted amino acid 
sequences with strong similarity to the DNA binding 
domains of the E75A protein and other steroid superfamily 
receptors . 

These two clones represent the EcR and DHR3 genes, 
as will become apparent* Probes from these clones were 
used to map the position of these genes in the polytene 
chromosomes by in situ hybridization. The EcR and EHE3. 
chromosomal loci were mapped to positions 42A and 46F, 
respectively, in the right arm of the second chromosome. 

B. Structure of the EcR and DHR3 Genes and Their 
SSMs - 

The DHR3 and EcR genomic clones described above were 

20 used to ^screen a cDNA library prepared from third instar 

tissues treated with ecdysone and cycloheximide . This 
procedure allowed the isolation of a large number of cDNA 
clones, since both genes have a peak period of 
transcription in late third instar after the rise in 

25 ecdysone titer. For each gene, 20 cloned cDNAs were 

purified and their lengths determined. Restriction maps 
for the 10 longest cDNAs from each gene were determined 
and found to be colinear. 

For EcR, a 5534 bp cDNA sequence was obtained from 

30 two overlapping cDNA clones. It contains an 878 codon 

open reading frame (ORF) which yields a predicted amino 
acid sequence expected for a member of the steroid 
receptor superfamily (Table 2), as described in more 
detail below. The length of the largest EB£3 cDNA that 

35 was isolated (clone DHR3-9) is 4.2 Kb. The nucleotide 

sequence of this cDNA was determined and found to contain 
a 487 codon AUG-initiated open reading frame (Table 3). 
As described below, the amino acid sequence of the DHR3 
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protein predicted from this sequence demonstrates that 
this protein is also a bon^ fide member of the steroid 
receptor superfamily. 
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Table 2. The cDNA sequence of the EcR gene. 
Numerals at the left refer to the nucleotide sequences; 
those on the right to the amino acid sequence in the EcR 
5 protein. Nucleotides 1-5194 are the sequence of EcR-17 

cDNA, while nucleotides 5195-5534 derive from the EcR-9 
cDNA. The underlined sequences in the 5 1 and 3' 
untranslated regions refer, respectively, to the ATG 
codons and the AATAAA consensus polyadenylation signals. 

xo Positions of the introns and the donor and acceptor 

splice sequences are indicated above the cDNA sequence in 
small type. The amino acid sequences homologous to the 
conceived DNA-binding (C region) and hormone-binding (E 
region) domains of the steroid receptor superfamily are 

15 underlined. 

Panel 1 is shown in detail in panels 2 and 3. 
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Table 3, The cDNA sequence of the DHR3 gene. 
The numbering and underlining of the nucleotide and amino 
acid sequences have the same meaning as in Table 2, and 
the intron positions and donor and acceptor splice 
sequences are similarly indicated. The sequence of the 
5' proximal 2338 nucleotides of the DHR3-9 cDNA is shown. 
The sequence of the remainder of this 4.2 kb cDNA was 
determined for only one strand and is not shown. Four 
silent, third-position differences between the cDNA and 
genomic DNA sequences are indicated above the cDNA 
sequence. 
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The genomic structure of the EcR and DHR3 genes 
was investigated by isolating additional genomic DNA 
clones that form overlapping sets that contain all of the 
sequences found in the respective cDNA clones. The exons 
5 contained in these cDNAs were mapped within the genomic 

DNA by comparison of cDNA and genomic clones via Southern 
blot analysis, mapping of restriction cleavage sites, and 
finally, by determination of the nucleotide sequence of 
the genomic DNA in regions that contain the exon/intron 
10 boundaries. Table 2 and 3 show these boundaries and the 

sequence of the splice junctions for the EcR and DHR3 
genes, respectively. All of these splice junctions 
conform to the splice donor and acceptor consensus 
sequences . 

15 For EcR , the cDNA sequence shown in Table 2 is split 

into six exons spread over 36 kb of genomic DNA, with the 
ORF beginning in the second exon and ending in the sixth. 
For DHR3, the cDNA sequence derives from nine exons 
spread over 18 kb, with the ORF beginning in the first 

20 exon and ending in the ninth. Because the 5' and 3» ends 

of the respective mRNAs were not mapped, it should be 
emphasized that these genes may have additional noncoding 
exons at their 5 ' or 3' ends . 

The EcR and DHR3 gene structures differ 

25 significantly from those of all previously examined 

steroid receptor superfamily genes. Comparison with the 
genes for 11 other receptor homologues for which at least 
partial structural information is available reveals that 
the positions of certain exon boundaries have been 

3 0 conserved in evolution. This conservation is most 

striking in the portion of the genes encoding DNA-binding 
domains. In the nine other cases where the structure of 
this region has been examined, the two halves of the DNA- 
binding domain are always encoded by separate exons. If 

35 we exclude the Drosophila genes fcpirps, ?qurps-re:uted, 

and eaon (which are not bona fide receptor homologues 
since they lack the hormone-binding domain sequence 
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similarity) , these are always small exons, the second one 
invariably ending in the fourth codon beyond the 
conserved Met codon at the end of the C region. Thus, 
these exons each encode one of the two predicted zinc 
5 fingers of the DNA-binding domain. In contrast, both 

zinc fingers of the putative DNA-binding domain of the 
EcR and DHR3 receptors are encoded by a single exon. It 
is possible that our screen specifically selected for 
genes lacking the above intron. The screen selected 

10 genomic clones that hybridize to an E75A cDNA probe that r 

of course, lacks this intron. Genomic sequences 
containing a contiguous sequence encoding the DNA-binding 
domain would be expected to hybridize to this probe 
better than clones from genes containing the intron. 

15 This would explain the successful isolation of the EcR 

and EMI genes, and the failure to isolate the genes of 
other Drosophila members of the steroid receptor 
superfamily. 

20 Methods ) 

Isolation of cDNA and additional genomic clones 

Subclones of the originally isolated DHR3 and EcR 
genomic clones were used to screen a cDNA library 
prepared from third instar tissues treated with ecdysone 

25 and cycloheximide. This library was chosen because both 

genes are relatively highly expressed at the end of third 
instar, and because of the high quality of the library. 
Of the 270,000 primary plaques screened, 20 positives for 
PHR3 and 220 for ScR were detected. Twenty cDNAs for 

30 each gene were purified, of which the ten largest for 

each were restriction mapped and found to be colinear. 
cDNA DHR3-9, which extends further in both the 5' and 3 1 
directions than our other DHR3 cDNAs, was chosen for 
sequencing. For £cR, the longest cDNA, EcR-17, extended 

35 the farthest 5 1 and was sequenced in its entirety. An 

additional cDNA clone, EcR-9, was found to extend 300 bp 
farther 3 ' than EcR-17 , and this 3 1 extension was also 
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sequenced. Additional genomic DNA clones covering the 
EcR and DHR3 genes were obtained by screening the 
Drosophila Canton S genomic library referred to in part A 
above, either with probes from the respective cDNA 
5 clones, or, for overlapping clones, by the chromosomal 

walk method described in Experimental Example I. 

DNA sequence analysis 

cDNAs were subcloned into BlueScript vectors 
10 (Stratagene) , and clones for sequencing were generated by 

exonuclease III digestion (Henikoff, S., 1984. 
Unidirectional digestion with exonuclease III creates 
targeted breakpoints for DNA sequencing. Gene 
28:351-359) . 

15 Double-stranded plasmids were denatured (Gatermann, 

K. B., G. H. Rosenberg, and N. F. Kaufer, 1988. Double- 
stranded sequencing, using mini -prep plasmids, in 11 
hours. BioTechnioues 6:951-952) and sequenced by the 
dideoxy .chain terminating method (Sanger, F. , S. Nicklen, 

20 and A. R. Coulson, 1977. DNA sequencing with chain- 

terminating inhibitors. Proc. Natl - Acad. Sci. PSA 
74:5463-5467), using the enzyme Sequenase (U.S. 
Biochemical) . cDNA EcR-17 was completely sequenced on 
both strands, as was the EcR-9 3' extension. cDNA DHR3- 

25 9 was sequenced on both strands for the 5' most 2338 bp, 

which contains the entire ORF, and the remainder of the 
long 3' untranslated region was sequenced on one strand. 

The exon/intron boundaries in genomic DNA clones 
were first mapped at low resolution by Southern blot 

30 analysis of their restriction fragments probed with 

labeled cDNAs. Genomic DNA surrounding each exon/intron 
boundary was subcloned and the nucleotide sequence of 
these subclones determined as above. 

Genomic exons were either sequenced entirely, or for 

35 the longer exons, were digested and electrophoresed in 

parallel with cDNA clones to confirm the colinearity of 
the genomic and cDNA clones. Shorter exons were 
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completely sequenced from genomic clones. Longer exons 
had their boundaries sequenced from genomic clones, and 
were confirmed to be colinear with the cDNA clones by 
parallel digestion and electrophoresis of the cDNA and 
genomic clones. 

C. The Predicted Amino Acid Sequence of the EcR 
and . PER3 Proteins and their TTn P T i ea ti ong 

Comparison of the predicted EcR and DHR3 protein 
sequences to the sequence database and to individual 
members of the steroid receptor super-family shows that 
these proteins share the two conserved domains 
characteristic of this superfamily (Evans, R. M., 1988. 
The steroid and thyroid hormone receptor superfamily. 
gg * ence 240:889-895; Green, S., and P. Chambon, 1988. 
Nuclear receptors enhance our understanding of 
transcription regulation . Trends in r^^i^ 4 : 3 09-3 14 ) . 
We refer to the domains as the C and E regions, for the 
more am^no-terminal and more carboxy-terainal homologies, 
respectively, according to the nomenclature of Krust et 
al. (Krust, A., S. Green, P. Argos, V. Kumar, P. Walter, 
J. M. Bomert, and P. Chambon, 1986. The chicken 
oestrogen receptor sequence; homology with v-erbA and the 
human oestrogen and glucocorticoid receptors. EMBO j. 
5:891-897). These domains are underlined in Tables 2 and 
3, and Table 4A-C presents a comparison of these domains 
from EcR and DHR3 with those from representative members 
of the superfamily. 
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Table 4. Sequence comparison of the conserved 
C and E regions in DHR3, EcR, and some representative 
nuclear receptor homologues. (A) C-region alignment. 
5 Numbers at the left indicate the amino acid positions 

within the individual receptors; dashes indicate gaps 
introduced to obtain maximal alignment. Dots indicate 
three positions important in determining the DKA binding 
specificity of this domain. (B) E-region alignment. 

10 Bars indicate the three most highly conserved stretches 

within this domain. (C) Computed percent identities 
among the C-region sequences (lower left) and among the 
E-region sequences (upper right) . The kni sequence shows 
no significant E-region homology and is, therefore, not 

15 included in this comparison. Sequences shown are from: 

E75A, Drosophila ecdysone- inducible gene at 75B; kni, 
Drosophila segmentation gene knirps ; hRARa, human 
retinoic acid receptor alpha; htRB, human thyroid 
receptor beta; hVDR, human vitamin D receptor; cOUP-TF, 

20 chicken ovalbumin upstream promoter transcription factor; 

hERRl and hERR2 , human estrogen-related receptors 1 and 
2; HER, human estrogen receptor; hGR, human 
glucocorticoid receptor; hMR, human mineralocorticoid 
receptor; hGR, human progesterone receptor. 

25 Panel 1 is shown in detail in panels 2-6. 
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The C region is a 66-68 amino acid domain that 
has been shown to function as a zinc finger DNA binding 
domain in vertebrate receptors. This domain has also 
been implicated in receptor dimerization (Kumar, v., and 
5 P. Chambon, 1988. The estrogen receptor binds tightly to 

its responsive element as a ligand- induced homodimer. 
Cell 55:145-156). As shown in Table 4A, all 19 C-region 
residues that are absolutely conserved in the other 
receptor homologues are also conserved in DHR3 and EcR, 

10 including the nine invariant Cys residues, eight of which 

coordinate two zinc ions (Freedman, L. P., B. F. Luisi, 
Z. R. Korszun, R. Basavappa, P. B. Sigler, and 
K. R. Yamamoto, 1988. The function and structure of the 
metal coordination sites within the glucocorticoid 

15 receptor DNA binding domain, liature 334:543-546) . As 

seen in Table 4C r the Drosophila c-region sequences 
(including those of E75A) are not more closely related to 
each other than they are to those from the vertebrate 
receptor t homologues. The C region of DHR3 is most 

20 similar £o that of the human retinoic acid receptor a 

(hRARa) , and the C region of EcR is most similar to that 
of the human thyroid receptor B (hTRB) . Studies on the 
human glucocorticoid receptor (hGR) and human estrogen 
receptor (hER) have identified three C-region residues 

25 (indicated by dots in Table 4A) that are critical for 

determining the differential DNA binding specificity of 
these receptors (Mader, S., V. Kumar, H. de Verneuil, and 
P. Chambon, 1989. Three amino acids of the estrogen 
receptor are essential to its ability to distinguish an 

30 estrogen from a glucocorticoid-responsive element. 

Native 338:271-274; Umesono, K. , and R. M. Evans, 1989. 
Determinants of target gene specificity for 
steroid/thyroid hormone receptors. Cell 57:1139-46). 
The three Drosophila proteins DHR3, EcR, and E75A, as 

35 wbII as the vertebrate receptors hRARa, hTRB, and the 

human vitamin D receptor (hVDR) , all have identical amino 
acids at these three positions; thus, these proteins may 
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all have similar DKA binding specificities, as has 
already been shown for hRARa and hTRB (Umesono, K. , 
V. Giguere, C. K. Glass, M. G. Rosenfeld, and 
R. M. Evans, 1988, Retinoic acid and thyroid hormone 
5 induce gene expression through a common responsive 

element. Nature 336:262-265) . 

The E-region is an -225 amino acid domain that 
functions as a hormone-binding domain in vertebrate 
receptors. This domain has also been implicated in 
10 hormone dependent receptor dimerization (Kumar, V. and 

P. Chambon, 1988. The estrogen receptor binds tightly to 
its responsive element as a ligand-induced homodimer. 
Cell 55:145-156; Guiochon, M. A., H. Loosfelt, P. Lescop, 
S. Sar, M. Atger, A. M. Perrot, and E. Milgrom, 1989. 
15 Mechanisms of nuclear localization of the progesterone 

receptor: evidence for interaction between monomers. 
Cell 57:1147-1154), hormone dependent nuclear 
localization of the glucocorticoid receptor (Picard, D. , 
and K. fc. Yamamoto, 1987. Two signals mediate hormone- 
20 dependent nuclear localization of the glucocorticoid 

receptor. pffBO J. 6:3333-3340), and binding of the 
glucocorticoid receptor to the 90 kDa heat shock protein 
(Pratt, W. B., D. J. Jolly, D. V. Pratt, 
W. M. Hollenberg, V. Giguere, F. M. Cadepond, 
25 G. G. Schweizer, M. G. Catelli, R. M. Evans, and 

E. E. Baulieu, 1988. A region in the steroid binding 
domain determines formation of the non-DNA-binding, 9 S 
glucocorticoid receptor complex. J. Biol. Cham. 263:267- 
273). Table 4B shows an alignment of the E regions of 
the DHR3 and EcR proteins with those of other receptor 
homologues. The three relatively highly conserved 
stretches within this region noted in Experimental 
Example I are overlined; each contains a cluster of 
residues conserved in all or most of the receptor 
35 sequences. DHR3 and EcR show strong similarity to each 

other and to the other proteins in these stretches, and a 
lower similarity outside of them. The presence of this 
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E-region homology establishes these proteins as bona fide 
members of the nuclear receptor family, in contrast to 
the pyosopfrilq RTUyps (Nauber, U., K. J. Pankratz, 
A. Kienlin, E. Seifert, U. Klemm, and H. Jackie, 1988. 
Abdominal segmentation of the Drosophila embryo requires 
a hormone receptor-like protein encoded by the gap gene 
JgaiEps. Nature 336:489-492), kniros -relate ( O ro, A. E., 
E. S. Ong, J. S. Margolis, J. W. Posakony, M. McKeown, 
and R. M. Evans, 1988. The Drosophila gene kniras- 
relatefl is a member of the steroid-receptor gene 
superfamily. N^tuye 336:493-496), and ecron (Rothe, M. , 
U. Nauber, and H. Jackie, 1989. Three hormone receptor- 
like Drosophila genes encode an identical DNA-binding 
finger. EMBO J. 8:3087-3094) proteins, which show c- 
15 region homology but no E-region homology. The E region 

in DHR3 is most similar to that of E75A, and the E region 
of EcR is most similar to that of hTRfl, although the 
level of these similarities is lower than those found 
among E Regions of many other receptors (Table 4C) . 
20 Thus, DH&3 and EcR are not especially close homologues of 

any previously cloned receptors. Comparison of E-region 
sequences allows division of the nuclear receptors into 
subfamilies (Petkovich, M. , N. J. Brand, A. Krust, and 
P. Chambon, 1987. A human retinoic acid receptor which 
25 belongs to the family of nuclear receptors. Nature 

330:444-450), the members of any one subfamily being more 
related to each other than to those in other subfamilies. 
The DHR3 and EcR receptors fall into a subfamily with the 
E75A, E75B, hRARa, hTRB, and hVDR receptors. 
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D. In Situ Labeling of the EcR and DHR3 Proteins 

with Antibodies Induced by Proteins Produced in 
E. coli __ 

35 To determine the intracellular and tissue 

distribution of the EcR and DHR3 proteins in Drosop hila r 
affinity-purified polyclonal antibodies directed against 
those proteins were produced in the following manner. 
The region of about 120 amino acid residues located 
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between the conserved DNA-binding and hormone-binding 
domains of these proteins was used as the immunogen to 
produce antibodies against each protein. Thus, the 
coding sequences for amino acids 335-447 of the EcR 
5 protein and for amino acids 164-289 of the DHR3 protein 

(see Tables 2 and 3, respectively) were cloned into the 
appropriate pATH (Dieckmann, C, and A. Tzagaloff, 1985. 
J. Biol. Chem. 260:1513-1520) or pUR expression vectors, 
so as to fuse these coding sequences to those encoding £. 

10 coli B-galactosidase (Bgal) or to E. coli tryptophan E 

protein (trpE) , respectively. 

The Bgal fusion proteins were produced in £. coli by 
the addition of the IPTG inducer to exponential cultures, 
while the production of trpE fusion proteins were induced 

15 by dilution into tryptophan-free media and subsequent 

addition of indoleacetic acid. For EcR, the trpE fusion 
protein was used as an immunogen and the Bgal fusion 
protein was used on immunoblots to test sera for 
immunoreactivity to the EcR portion of the fusions. For 

20 DHR3, the Bgal fusion protein was injected, and sera were 

checked against the trpE fusion protein. 

For immunization the appropriate fusion protein was 
prepared by electrophoresis on SDS-PAGE gels and 
visualized by staining in ice-cold 0.25 M KC1, after 

25 which the fusion protein band was cut out. Approximately 

100 of fusion protein in 0.25 ml of gel slice was 
crushed by passing through successively smaller 
hypodermic needles, and mixed with 0.25 ml of a sterile 
saline solution and 0.5 ml of Freund's complete adjuvant. 

30 For each immunogen, two New Zealand White rabbits were 

injected at multiple intramuscular sites, and after one 
month, boosted at two-week intervals, omitting the 
Freund's adjuvant. While the Bgal fusion proteins were 
subject to the above gel electrophoresis without prior 

35 purification, the trpE fusion proteins were first 

purified by the following method which takes advantage of 
their insolubility in vivo . 
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E* Sfili from a two-liter culture of induced cells 
were washed, and the cell pellet was subjected to several 
freeze/thaw cycles. The cells were resuspended in 18 ml 
of 50 mM Tris-HCl, pH 7.5, 0.5 mM EDTA, and 1.8 ml of 10 
mg/ml lysozyme was added. After 15 minutes on ice, the 
cells were lysed by passing three times through a french 
pressure cell at 10,000 psi. The insoluble fraction was 
collected by centrifugation at 27,000 x g for 15 minutes, 
and washed by resuspension, using a Dounce homogenizer, 
in ice-cold 50 mM Tris-HCl, 0.5 mM EDTA, 0.3 M NaCl, 
followed by centrifugation as above. The washing step 
was repeated, and the final pellet dissolved in 10 ml of 
4M Urea, 2% (w/v) SDS, 50 mM Tris-HCl, pH 7.5, 1 mM EDTA, 
5% (v/v) 2-mercaptoethanol. Material remaining insoluble 
15 was centrifuged out and discarded. 

The antisera was affinity purified in a two-step 
procedure by successive passage through "nonspecific" and 
"specific" affinity columns. In the case of antibodies 
raised against the trpE fusion proteins, the nonspecific 
20 column consisted of resin coupled to the insoluble 

protein derived from £• coli expressing unmodified trpE 
protein, and was used to remove antibodies directed 
against trpE epitopes, as well as against insoluble E. 
coli protein impurities. The specific column consisted 
of resin coupled to the EcR-trpE fusion protein (purified 
as described above) and was used to absorb the desired 
antibodies directed against the EcR epitopes, antibodies 
that were subsequently released from the column. In the 
case of antibodies raised against the Bgal fusion 
proteins, the same general procedure was used, except 
that the resin in the nonspecific column was coupled to 
B-galactosidase, while that in the specific column was 
coupled to the DHR3-6gal fusion protein. Western blot 
analysis of the appropriate £. coli extracts demonstrated 
that these affinity-purified antibodies exhibited the 
desired specificity. 
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The intracellular distribution of the EcR protein in 
late third instar salivary glands was examined by in situ 
labeling of this protein with the anti-EcR antibody. The 
EcR protein was thereby shown to be highly localized in 
the nuclei. of these glands. Indeed, when the polytene 
chromosomes in these nuclei were examined by the 
antibody-labeling method of Zink and Paro (Zinc, B., and 
R. Paro, 1989. Nature 337:468-471), specific loci within 
these chromosomes exhibited strong binding of the EcR 
protein. In particular, the EcR protein was bound to the 
early puff loci, including those occupied by the £25 and 
£74 genes. This is the result expected if the ecdysone 
receptor encoded by the EcR gene is that which induces 
the transcription of the early genes, as anticipated by 
15 the Ashburner model. Another prediction of the Ashburner 

model is that the ecdysone-receptor complex initially 
represses the genes responsible for the later puff, so 
that the transcription of the late genes induced by the 
early gene proteins is delayed until these proteins 
20 accumulate sufficiently to overcome this initial 

repression. If the EcR receptor is involved in this 
postulated initial repression, then one would expect the 
EcR protein to bind to the late puff loci in the salivary 
glands. This expectation was met by the observation that 
25 EcR protein also binds to the late puff loci in the 

polytene chromosomes. 

Additional in situ antibody labeling experiments 
demonstrated that the EcR protein is present in the 
nuclei of all ecdysone target tissues examined in late 
30 third instar larvae. It is also present in most, if not 

all, cells during embryogenesis and other stages of 
Drosophila development that have been examined. In this 
respect, the EcR protein was not detected by anti-EcR 
antibody labeling of embryos in which the EcR gene was 
35 eliminated by a chromosomal deletion, further 

demonstrating the specificity of this antibody. 
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In contrast to the widespread distribution of the 
EcR protein, anti-DHR3 antibody labeling of embryos 
demonstrated that the distribution of the DHR3 protein is 
highly restricted during this stage of development. 
During the brief embryonic period of expression, the 
protein is restricted to the peripheral nervous system, 
and to cells surrounding the spiracles at the posterior 
end of the embryo. 

Finally, it should be noted that affinity-purified 
antibodies against the E75A protein have also been 
prepared by the same technique described above for anti- 
EcR and anti-DHR3 antibodies, m situ antibody labeling 
of the E75A protein in larval salivary glands has also 
demonstrated that this protein is localized in the 
nucleus and is bound to specific loci in the polytene 
chr omo s omes . 

EXAMPLE TTT; 

THE ECDYSTEROITKBINDING , DNA- BINDING AND GENETIC 
REGqliATORY PROPERTIES OF THE EcR PROTEIN DEMONSTRATE 
THAT IT IS AN ECDYSONE RECEPTOR. 

The following experiments demonstrate that the 
protein encoded by the Ec£ gene is an ecdysone receptor 
by the following three criteria. (l) The EcR protein 
binds ecdy steroids and accounts for a large proportion, 
if not all, of the ecdysteroid-binding activity present 
in Dyosopnil^ embryos and in a variety of cultured 
prosgpfr i la cells. (2) The EcR protein binds with high 
specificity to a DNA sequence that functions as an 
ecdysone response element (EcRE) , i.e., an enhancer that 
confers ecdysone inducibility to a promoter. (3) Cells 
that do not respond to ecdysone because they lack 
functional ecdysone receptors are transformed to the 
ecdysone-responsive state by transfection with an EcR 
expression plasmid. 

A - The EcR Protein Bli nds Ecdy sterm'Hg 
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The EcR expression plasmid, pMTEcR, shown in 
Figure 1 contains the open reading frame encoding the EcR 
protein ( EcR ORF; see Experimental Example II) fused to 
the prosophila metallothionine promoter (Pmt) at its 5 % 
5 end, and the polyadenylation-cleavage sequences of the 

Prosophila Actin 5£ gene at its 3* end. Because 
transcription of the £cB ORF is under the control of this 
metallothionine, that transcription is induced by Cu 2+ ion 
to yield an mRNA that, in turn, produces the EcR protein. 

10 A C eii line, MtEcRHy, that overproduces this protein upon 

Cu 2+ induction, as determined by Western blot analysis 
using the affinity-purified anti-EcR antibody (see 
Experimental Example II) , was constructed by the stable 
integration of the pMTEcR plasmid DNA into the genome of 

15 Prosophila Sch-2 cell line. A control cell line, MtHy, 

was similarly constructed by the integration of the 
expression vector DNA lacking the £cR ORF. 

Whole cell extracts were prepared from both the 
MtEcRHy and MtHy cell lines after Cu 2+ induction, and were 

20 assayed for ecdysteroid-binding activity using the high 

affinity ecdysone analogue l^ 25 !} iodoponasterone A. The 
MtEcRHy extract contained sevenfold more saturable 
ecdysteroid-binding activity than the MtHy control 
extract. 

25 To see if the induced ecdysteroid-binding activity 

was due to the EcR polypeptide itself, the EcR protein 
was depleted from the MtEcRHy extract by 
immunoprecipitation using an affinity-purified anti-EcR 
polyclonal antibody, or, as a control, the extract was 

30 mock-depleted with preimmune serum. The treated extracts 

were then assayed for ecdysteroid-binding activity. 
Comparison of the immuno-depleted extract with the mock- 
depleted extract showed that most of the binding activity 
was removed by the anti-EcR antibody treatment, 

35 indicating that the induced ecdysteroid-binding activity 

results from the EcR protein. 
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The endogenous ecdysteroid-binding activity in the 
control cell line, MtHy, was unchanged by Cu 2+ exposure, 
and was approximately the same as that in the Sch-2 cell 
from which it derives. The question arises as to whether 
the endogenous activity in these and other Drosoohila 
cell lines, as well as in embryonic extracts, results 
from the expression of the £sS gene in their respective 
genomes. To answer this question, extracts from embryos 
and several cell lines were immuno-depleted and mock- 
depleted, as described above, and assayed for 
ecdysteroid-binding activity. Again, comparison of these 
treated extracts showed that the large majority of the 
endogenous binding activity was removed in each case by 
treatment with the anti-EcR antibody. Thus, it appears 
that most, if not all, of the endogenous binding activity 
in embryos and cell lines results form the resident EcR 
gene. 

Kethods 
Extracts * 

Tissue culture cell extracts for hormone and DNA- 
binding experiments were prepared as follows. Cells were 
grown in spinner flasks to a density of 5-7xl0 6 cells/ml, 
and were washed once in EcR buffer (25 mM Hepes, pH 7.0, 
40 mM KC1, 10% (v/v) glycerol, 1 mM EDTA, 1 mM 
dithiothreitol, and the following cocktail of protease 
inhibitors: 10 mM Na 2 S20 5 , 500 /iM PMSF, 1 }M leupeptin, 
1 MM pepstatin). All further manipulations were at 4*C. 
Cells were resuspended in EcR buffer at 2% of the 
original culture volume, divided into 3 ml aliquots, and 
sonicated using 30 1/2 second pulses with a probe 
sonicator (Bronson Sonifier 450) , resulting in 
disruptions of -95% of the cells. After centrifugation 
at 100,000 x g for 1 hour, 100 pi aliquots of supernatant 
were frozen in liquid nitrogen, and stored at -80 'C. 
Protein concentration was determined using bovine serum 
albumin as the standard, and was typically 6-11 mg/ml. 
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Embryo extracts were prepared by a similar protocol: 
3-6 hour Canton S embryos were dechorionated in 55% 
commercial bleach for 2 minutes, washed extensively in 
0.7% NaCl, and resuspended using 2 grams of embryos per 
5 ml of EcR buffer. Embryos were broken with 20 strokes in 

a Dounce homogenizer using a B pestle, and lysis was 
completed with the probe sonicator using the same 
settings as used for the tissue culture cells. The 
extract was adjusted to 400 mM KC1, centrifuged 1 hour at 
10 100,000 x g, and aliquots of supernatant were frozen. 

This extract contained 13.4 mg/ml protein. Before use in 
hormone binding, it was diluted tenfold in EcR buffer 
lacking KCl to bring the final KC1 concentration to 
40 mM. 

15 

Hormone-b inding assays 

For hormone-binding experiments, extracts were first 
diluted to the following concentrations in EcR buffer: 
0.9 mg/ml for MtHy and MtEcRHy extracts, 3 mg/ml for S2 
20 and SRS i.5 extracts, 4 mg/ml for the Kc cell extracts, 

and 1.3 mg/ml for the embryo extract. All manipulations 
were done on duplicate samples in order to quantify 
variability in the results. For immunoprecipitation 
experiments, extracts" were immuno-depleted , mock- 
25 depleted, or left untreated. For depletions, 300 Ml of 

diluted extract was incubated for 30 minutes at 25'C with 
3.5 Ml affinity-purified anti-EcR antibody, or with 
3.5 Ml preimmune serum for the mock-depletion control. 
Then 38 Ml 10* fifanhvloco r.cus aureus (Pansorbin, 
Calbiochem) in EcR buffer was added, and incubation was 
continued for 15 minutes at 25 'C. After centrifugation 
for 3 minutes in a microcentrifuge, the supernatant 
(depleted extract) was recovered. The 
immunoprecipitation was repeated, except in the case of 
35 the embryo extract, which was subjected to only one round 

of precipitation. The "untreated" extract aliquots were 
left at 4*C for the duration of the depletion procedure, 
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and were diluted with EcR buffer to match the final 
concentration of the depleted aliquots. 

A modification of the hormone-binding assay of P. 
Cherbas was used (Cherbas f P. 1988. Proc. Nat'l Acad. 
Sci., U.S.A. 85:2096-2100)* Assay tubes contained 140 pi 
extract, 14 /il [^I] iodoponasterone, and either 14 pi 
EcR buffer or 14 pi unlabelled 20-OH ecdysone in EcR 
buffer as a competitor. [ 125 I] iodoponasterone was 2177 
Ci/mM and was used at a final concentration of SxlO' 10 M in 
the assay; 20-OH ecdysone was 2xl0* s M final concentration 
in the assay. After incubation for 1 hour at 25 *C, each 
reaction was spotted on a dry Whatman GF/C filter 
(2.4 cm), and after 30 seconds the filter was washed by 
using a vacuum to draw 10 ml EcR buffer through the 
filter over a period of 1 minute. Filters were placed in 
800 jil 4% SDS, and radioactivity was measured in a 7 
counter. The hormone-binding activities shown are 
saturable binding activities, calculated as the total 
binding activity, as measured in assays with no added 
competitor, minus the unsaturable binding activity, which 
was measured in the assays with excess unlabelled 
ecdysone added. In the most active extracts, the 
unsaturable activity (representing the large number of 
low affinity binding sites in the extract) was less than 
10% of the total activity. 

B. Genetic Regulatory Activity of the EcR Protein 



An ecdysone-inducible reporter plasmid, 
pEcRE/Adh/figal (Figure 2) , was constructed to test the 
regulatory functions of the EcR protein in vivo . The 
reporter gene in this plasmid consists of the sequence 
that encodes the E. coli B-galactosidase (flgal ORF) 
linked through the 5» leader sequence of the Drosoohila 
Ultrabithorax gene ( UBX leader and MS) to an ecdysone- 
inducible promoter. This promoter was created by fusing 
a truncated version of the proximal promoter for the 
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Drosophila Adh gene (P DA dh-34 + 53, the numbers indicating that 
it consists of the sequence from base pair positions -34 
to +53, which just includes the TATA box) to seven 
repeats of a 34 bp synthetic oligonucleotide (7 £cR£ 
5 oligos) which contains the ecdysone response element 

(EcRE) from the ecdysone-inducible heat shock gene hs£ 21 
(Riddihough and Pelham, 1987. EMBO 6:3729-3734), The 
seven EcREs should confer ecdysone-inducibility to the 
truncated promoter, provided that the cells transfected 

10 with this reporter plasmid contain the appropriate 

ecdysone receptor. 

This ecdysone-inducible reporter plasmid was 
constructed by insertion of the 7 EcRE OLIGOS into 
plasmid pAdh/Bgal, which is identical to pEcRE/Adh/Sgal 

15 except that it lacks the array of ecdysone response 

elements. The pAdh/Bgal plasmid should therefore not be 
ecdysone inducible and can serve as a control. To test 
these expectations, Sch-2 cultured cells (which were 
shown ab^ve to contain endogenous ecdysone-binding 

20 activity) were transfected with each plasmid and examined 

for 6-galactosidase activity in the presence and absence 
of ecdysone. The ecdysone-induced 6-galactosidase 
activity in the pEcRE/Adh/figal transfected cells was 
2000-fold greater than when such cells were not exposed 

25 to ecdysone, whereas ecdysone had little effect on the 

pAdh/Bgal transfected cells. These results indicate that 
the EcREs confer ecdysone-inducibility on the PnAdh-34+53 
promoter, as expected, and that the Sch-2 cells contain 
functional ecdysone receptors. 

30 xo test the function of the EcR receptor in such a 

system, host cells lacking functional ecdysone receptors 
are required. "Ecdysone-resistant" cells lacking 
ecdysone-binding activity, and hence, presumably 
functional receptors, can be produced by continuously 

35 exposing ecdysone-responsive cells to ecdysone during a 

period of several weeks. This ecdysone-resistant state 
is then maintained in ecdysone-free media for several 
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months. An ecdysone-resistant cell line, SRS 1.5, was 
therefore generated by growing Sch-2 cells in 3x10^ M 
ecdysone. The SRS 1.5 cells lack significant ecdysone- 
binding activity. 

When these cells were transfected with the 
pEcRE/Adh/6gal plasmid and subsequently exposed to 
ecdysone, very little ecdysone-induced B-galactosidase 
activity was observed, indicating that the cells have 
only trace amounts, if any, of functional receptors. To 
test whether the expression of the EsS gene can "rescue" 
this deficiency, the SRS 1.5 cells were cotransfected 
with two plasmids: the ecdysone-inducible reporter 
plasmid, PEcRE/Adh/ Bgal , and a constitutive expression 
plasmid for the Ecg gene, pActEcR, in which transcription 
of the ficg ORF is .controlled by the Drosophna Actin 
promoter, (Figure 3) . Cotransfection with these two 

plasmids, followed by exposure to ecdysone, resulted in a 
dramatic induction of fl-galactosidase activity. Thus, 
introduction of this Ecr expression plasmid into the 
20 SRS 1 - 5 cells regenerated the ecdysone-inducibility they 

had lost. 
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Methods 

Construction Of the pAdh/fltyal . pEcRE/Adh/ftc f al and BAnt-RrP 

Plasmid pAdh/flgal was constructed in two steps. The 
Bglll-Scal fragment of pDA5'-34, containing nucleotides - 
34 to +53 of the Drosophiia Adh. distal promoter, was 
cloned into pUC18 cut with Seal and BamHI. The resulting 
plasmid was cut with EcoRl, and the EcoRl fragment of 
cPBbxd6.2 (containing the Sfex. untranslated leader and 
AUG, the Bgal open reading frame, and the SV40 splice and 
poly A signals) inserted. 

To construct pEcRE/ Adh/ Bgal from pAdh/Bgal, two 34- 
residue oligonucleotides were synthesized: 

5 • TCGAGAGACAAGGGTTCAATGCACTTGTCCAATG3 1 

3 ' CTCTGTTCCCAAGTTACGTGAACAGGTTACAGCT5 • 
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These will anneal to form 30 bp duplexes with Sail 
compatible four nucleotide overhangs at their 5' ends, as 
shown. Further annealing via the 5' overhangs allows 
formation of tandem arrays that can be inserted into 
5 pAdh/Bgal at its Sail site just upstream from the TATA 

box of the truncated Adh promoter. When these 
oligonucleotides were kinased, annealed, ligated into 
Sail-cut pAdh/Bgal and cloned, pEcRE/Adh/Bgal was 
obtained. Restriction mapping showed that it contained a 
xo tandem array of seven 34 bp repeats, each of which 

contains the 23 bp ecdysone response element (EcRE) 
present in the bs£ 27. gene, the remaining 11 bp 
representing flanking fasja 27 sequences and the 5» 
overhangs . 

15 The constitutive EcR expression plasmid, pActEcR, 

was formed by inserting the Fspl-Hpal fragment of an EcR 
cDNA containing bp 851-4123 that contains the ORF 
encoding the EcR protein (Table 2) , into the £coRV site 
of the AatSV40BS plasmid. This expression vector was 

20 constructed in two steps by inserting the Xbal-EcoRl 

fragment of cosPneoB-gal, containing the SV40 splice and 
poly A signals, into BlueScript+KS (Stratagene) cut with 
SacII and Xbal, blunting the EcoRl and SacII ends. The 
resulting plasmid was digested with BamHl and Apal, and 

25 the BamHl -EcoRl fragment of pPAc was inserted, with the 

Apal and EcoRl ends being blunted. 

Tr^nsfection and generation of the cell line SRS 1.5 
The cell line SRS 1.5 was obtained by growing 

30 Schneider line 2 (Sch-2) cells in the presence of 3x10"* M 

20-OH ecdysone (Sigma) . This treatment initially halts 
growth of Sch-2 cells, but after several weeks the 
adapted cells grow well. SRS 1.5 cells were washed in 
hormone-free medium and passed several times in hormone- 

35 free medium prior to their use in trans fection 

experiments. Cells were transfected by the calcium 
phosphate technique. Cells were transfected with 10 pg 
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of each plasmid used? when only a single plasmid was 
being transfected, 10 of pUC18 DNA was added as a 
carrier. In general, all transfections were carried out 
in duplicate. Twenty- four hours after transfection, 
cells that were to undergo hormone treatment were split 
into two dishes, one of which was treated with 2X10" 6 M 
20-OH ecdysone. 

fi-aalactosidase assays 

Forty-eight hours after transfection, 2 ml of cells 
were washed once in PBS (137 mM NaCl, 27 mM KC1, 65 mM 
Na 2 HP0 4 , 15 mM KH 2 P0 4 , pH 6.8), and were resuspended in 
50 /il of 0.25 M sucrose, 10 mM Tris, pH 7.4, 10 mM EDTA, 
and repeatedly frozen in liquid nitrogen and thawed in a 
37 *C water bath for a total of 3 freeze/thaw cycles. 
Cell debris was removed by a 10-minute centrifugation in 
a microcentrifuge at 4'C. The concentration of protein 
in the supernatant (cell extract) was determined by the 
Bradford method, with bovine serum albumin as a standard, 
and was typically 1.5-2.5 mg/ml. Extracts were assayed 
immediately or frozen and assayed up to two weeks later 
with no loss in activity. To 10 /il of extract, or an 
appropriate dilution, 500 /il of assay buffer was added 
(0.6 mM 4-methylujnbelliferyl-B-D-galactoside, 60 mM 
Na 2 HP0 4 , 40 mM NaH 2 P0 4 , 10 mM KC1, 1.0 mM MgS0 4 , pH 7.0). 
After a 30-minute incubation at 37 *C, reactions were 
stopped with 500 /il of 300 mM glycine, 15 mM EDTA, 
pH 11.2. The fluorescent reaction product was quantified 
on a Perkin-Elmer LS-5B luminescence spectrometer, with 
Aot=365 nm and Aeni=450 run. Bgal activities are given as 
fluorescence units per /xg protein assayed. 

C. Specific Binding of the EcR Protein to Ecdysone 



The simplest explanation of the results described in 
the preceding section is that the EcR protein generated 
by the EcR expression plasmid binds to the EcRE of the 



Response Elements 
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reporter plasmid and, in combination with ecdysone, 
activates the minimal £d& promoter in that plasmid. The 
following experiment was designed to test whether the EcR 
protein exhibits specific binding to this EcRE in vi££o. 
5 Two plasmids were used: pUC18, which serves as the 

control, and pUC18-EcRE, which was generated by 
substituting the Hindll-Xbal fragment from pEcRE/Adh/Bgal 
that contains the seven repeats of the 34 bp EcRE 
oligonucleotide, for the Hindll-Xbal fragment of pUC18. 

10 Because the only difference between these two fragments 

is the seven oligonucleotide repeats, this is also the 
only difference between the two plasmids. 

The two plasmids were digested with ApaLI and Hind 
III, end-labeled with 32 P and mixed with an extract from 

15 MtEcRHy cells in which the EcR protein was overexpressed 

by Cu 2+ induction (see section A, above) . After a 
15-minute incubation at 25 'C to allow EcR-DNA binding to 
occur, affinity-purified anti-EcR antibody was added. 
The 25 *Cj incubation was continued for an additional 

20 40 minutes, at which time anti-rabbit Ig-coated magnetic 

beads (Dupont Magnasort-R) were added, and the incubation 
continued 15 minutes more. The beads were separated from 
the solution magnetically, similarly washed, and the DNA 
eluted from the beads in 1% SDS at 65 -C. The eluted DNA 

25 was ethanol precipitated and fractionated by 

electrophoresis in an agarose gel, which was dried and 
autoradiographed . 

Only the fragment containing the EcRE 
oligonucleotide was specifically and efficiently 

30 registered on the autoradiographs, and that registration 

was dependent upon the anti-EcR antibody. Quantitative 
analysis of the autoradiographs demonstrated a 10 3 -fold 
preference for binding to the EcRE oligonucleotide over 
the average vector sequences, under the conditions of 

35 this assay (see Methods, below) * 
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According to the criteria 
this Experimental Example, the 
satisfies the definition of an 



stated at the beginning of 
EcR protein clearly 
ecdysone receptor. 



Metfrofls 

Conditions for the DNA binding q MJ , Y 

A quantity of 0.2 fmole of digested, labelled 
plasmid DNA was mixed with 2 fig (dl/dC) in 10 /il of TE 
(10 mM Tris-HCl, pH 8.0, 1 mM EDTA) , and 90 /xl of the 
MtEcRHy extract, diluted to 0.9 mg/ml in EcR buffer 
adjusted to- 180 mM KC1, was added. After binding for 
15 minutes at 25 'C, 2 ml of affinity-purified anti-EcR 
antibody, diluted l.5x in EcR, was added, and this 
incubation was continued at 25 -C for 40 minutes, when 
50 Ml of anti-rabbit Ig-coated magnetic beads (Dupont 
Magnasort-R) , exchanged into 180 mM KC1 EcR buffer, was 
added and the incubation continued for 15 minutes. 

The beads were washed twice in 400 pi 180 ntM KC1 EcR 
buffer, and DNA was eluted from the beads by soaking 
twice in 200 jil 1% SDS in TE at 65 *C. The eluted DNA was 
ethanol precipitated and run on an agarose gel, which was 
dried and autoradiographed. As controls, one half of the 
input DNA (0,1 fmole) was run on the gel for comparison, 
and the binding assay was carried out, leaving out the 
antibody. 

EXAMPLE TV 
RECEPTOR GENE MUTAGENESIS. 

Mutations in the steroid receptor superfamily genes 
can alter their function in two ways. Most obviously, 
they alter the sequences encoding the receptor proteins 
and thus alter the receptor function. Alternatively, 
they can alter the expression of these genes — an 
alteration that can be at any level of that expression 
from transcription of the gene to the translation of its 
mRNA(s) . Such mutations can change the timing of gene 
expression during development or change the tissue and 
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cell distribution of that expression, thus, profoundly 
changing the course of development. Furthermore, these 
mutations provide information about the regulation of 
receptor gene expression, just as mutations that alter 
5 the structure of the receptors encoded by these genes 

provide information about the genes whose expression 
these receptor proteins control. In particular, 
mutations that alter receptor gene expression can lead to 
the identification of the proteins and other regulatory 

XO molecules that control their expression- Clearly, 

mutagenesis of insect steroid receptor superfamily genes 
provides an important avenue leading to an ability to 
interfere in a highly specific manner with insect 
development, and thus to control insect infestations 

15 deleterious to human health and agriculture. 

We have carried out mutagenesis experiments for two 
Drosophila members of the steroid receptor superfamily 
genes, E75 and EsB, that we have cloned and characterized 
with respect to their expression. In this experimental 

20 example, mutagenesis of the EZ5 g*n<* is described. 

A. Deletion Mutations 

In Drosophila , genetic analysis for a given locus - 
- in this case, the early puff locus at 75B that houses 

25 the £75 gene — generally depends upon the isolation of 

deletions of all or part of that locus. Such deletions 
greatly facilitate the subsequent isolation of point and 
other small mutations within the locus. By isolating 
mutations that are revertants to the neighboring dominant 

30 Wrinkled (W) mutations, we have isolated and molecularly 

mapped the boundaries within our chromosomal walk (see 
Experimental Example I) of two deletions, W* 4 and J?* 10 , 
generated by gamma ray mutagenesis, the preferred way of 
generating such large alterations of genomic structure. 

35 one of these, if 10 , extends distally from Wrjnkleq to 

cover the entire £75 gene; and the other, W R4 , extends to 
a point about 90 kb upstream of the 5» end of the 50 kb 
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E25A transcription unit and does not include the E75 
gene. 

An F2 screen was then employed to screen for gamma 
ray-induced mutations mapping to the 200 kb distal region 
5 that is included in the W* 10 deletion but not the E* 4 

deletion. This screen resulted in the isolation of five 
members of a single lethal complementation group that 
molecular mapping data demonstrate represents the US 
gene. The most useful of these five mutations is the 

10 £2S X48 mutation. Molecular mapping of this mutation 

demonstrated that it is a 105 kb region that includes all 
of the E75_ gene. This method provides an extremely 
efficient screen for other EZ5_ mutations, i.e., by 
screening for mutations that cannot complement this 

15 deletion mutation. 

B. E75 Mutations Generated by Ethyl Methane 
Sulfonate J 

The) chemical mutagen ethyl methane sulfonate (EMS) 
was used for this screen, as it is the preferred method 
for generating point or small mutations. An F2 screen of 
15,000 lines resulted in the isolation of 23 penetrant 
. mutations within the 105 kb region of the E75 X48 deletion, 
all of which turned out to be alleles of E75. it appears 
that this 105 kb region was saturated by this screen with 
respect to lethal complementation groups, and hence, £75. 
appears to be the only lethal complementation group in 
this region. Adding the five E75 mutations described 
above, a total of 28 penetrant E7£ alleles have thus been 
isolated, several of which are temperature-sensitive 
alleles . 

Inter, se complementation studies among these alleles 
and examination of their phenotypes reveal a complex 
complementation group ~ a complexity that probably 
results from the fact that the E75 gene contains two 
overlapping transcription units, a 50 kb E25A, unit and a . 
20 kb E75B unit that occupies the 3 ' end of the E2M unit 
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(see Experimental Example I and Table 1) . These alleles 
can be roughly divided into two groups: (1) those that 
cause lethality in early development, during the latter 
part of embryogenesis or during early larval development, 
and (2) those that cause lethality late in development, 
during the prepupal or pupal stages. 

This division correlates with the stages when the 
E75A and E75B units are expressed. Thus, E75A 
transcription is associated with each of the six pulses 
of ecdysone, including those that mark the embryonic and 
early larval stages. By contrast, EZ5B mRNAs are not 
observed until the end of the last larval stage, being 
particularly abundant during the pupal stage. This 
correlation invites the speculation that the early lethal 
mutations affect the expression of the ElSh unit and its 
E74A protein, and that the late lethal mutations 
specifically affect the expression of the fi75B unit and 
its E75B protein. This proposition can be tested by 
detailed ^molecular mapping of these mutations and further 
examination of their phenotypes at the molecular level to 
determine the causes of lethality. 

The mutants described here provide a foundation for 
the further genetic analysis of the £75 gene that will 
allow exploration of the requirements for appropriate E75 
expression and function and will identify structural and 
functional domains of EZ£. Some of the future 115 
studies will best be performed by its in vitEfi 
manipulation, followed by transformation of the 
constructs back into Drosonhila . Finally, it will be 
desirable to identify interacting genetic loci — 
interactions that may occur at the level of regulation of 
E75 expression or at the level of interaction of the E75 
proteins with those encoded by other genes. Such 
interactive genetic loci can be identified via the 
isolation of mutations that act as suppressors or 
enhancers of the E75 mutations. 
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Methods 

Strains , marker s and chromosomes 

For this aspect of the invention, the following 
strains, markers and chromosomes were used. Tu 2 was 
described by Lindsley (Lindsley, 1973. DIS 50:21). All 
other strains and mutations are as described (Lindsley, 
and Grell, 1968. Genetic Va riation of Drosa phii* 
p&lanQcfusteiz, Publication 627, Carnegie Institute of 
Washington, Washington, DC) . ru & W R4 e^ca was 
constructed by recombination between ru w R4 sbd 2 <gg and 
S£ e s ro ca. The at in ri E P M 2 chromosome was 

constructed by recombination of st jji ri p* with sbd 2 . in 
order to allow marking of this chromosome over W* 4 and 

RIO 

W , and homozygosed by crossing to TM3, backcrossing to 
TM3, and mating of isogeneic sibling progeny. The st p p 
ell line was homozygosed by standard ionic procedures. 
&Qt£ and ns are described in Scott et al. (1984) 
Proc. Nat'l Acad. Sni T nsa 81:4115-4119. The pupal 
lethals ^19, g26, 213B, Mil, iX-14 . 2612, M5, £4, o30L , 
SS416, 13B115 , 052 and wcr49 are described in Sheam 
(1974) Genetics 77:115-125. All strains used to 
construct the strains described above and other strains 
were obtained from the Bowling Green and Caltech stock 
centers . 

TMl, TM3 and TM6B (Lindsley, and Grell, 1968. 
Genetjc Variation of Drosoph i la melan Q q ag t_Ar f Publication 
627, Carnegie Institute of Washington, Washington, DC) 
are balancer chromosomes carrying recessive lethal 
mutations along with multiple inversions to suppress 
recombination. This allows the maintenance, as a 
heterozygote, of a recessive lethal chromosome in its 
original state. These chromosomes are also marked with 
convenient visible markers. 

Quantitative Sout hern blot mapping for detection of 
mutant lesions 



Prtnted flrwn Mfcnou 



WO 91/13167 PCT/US91/01189 



89 

DNA was prepared from adult flies (about 50) by 
douncing in 1 ml of 10 mM Tris-HCl, pH 7.5, 60 mM KaCl, 
10 mM EDTA, 0.15 mM spermine, 0.2 mg/ml proteinase K. 
The homogenate was added to an equal volume of 0.2 M 
5 Tris-HCl, pH 9.0, 30 mM EDTA, 2% SDS, 0.2 mg/ml 

proteinase K, incubated at 37 *C for 1 hour, and then 
extracted twice with buffer-saturated phenol and once 
with 24:1 chloroform/isoamyl alcohol. DNA was EtOH 
precipitated twice, hooking the pellet out without 

10 centrifugation. Southern blot hybridization was as 

described (Segraves, W. et al., 1984. J. Mol. Biol. 
175:1-17). Where restriction fragment length 
polymorphism was not used in order to distinguish the 
parental chromosome from the balancer chromosome, 

15 quantitation of band intensity on genomic Southerns was 

achieved using a scanning densitometer. By using a 
control probe outside the mutant region, the amount of 
DNA in each track was internally controlled. Comparison 
of deficiency heterozygote to wild type bands, when 

20 normalized to a control band in this way, gives little 

deviation from the expected 1:2 ratio. 

flolecular cloning of mutant lesions 

Restriction fragments of the appropriate size were 

25 isolated by preparative low melting agarose (FMC) 

electrophoresis of about 20 of restricted genomic DNA. 
The 6 kbsf 4 Xhol fragment was cloned into Xhol-cleaved 
ASE6DBam which is propagated as a plasmid in order to 
grow the vector and cannot be packaged without an insert. 

30 The 18 kb W mo Sail fragment was cloned into the Sail site 

of AEMBL3, cleaved also with EcoRI for the biochemical 
selection method for the prevention of propagation of 
non-recombinant clones. The 7 kb EcoRI fragment 
containing the }Q1 breakpoint was cloned into EcoRI - 

35 cleaved A607. Plating of recombinants on the htlh strain 

RY1073 prevented plaque formation by non-recombinant 
phage. The 14 kb jc48 EcoRI fragment was cloned into the 
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EcoRI site of AEMBL4, which had been cleaved with BamHI 
to utilize the "biochemical selection" for recombinants. 
The breakpoint fragments of xli and the recipient 
fragment were cloned into ASE6ABam. Libraries were 
packaged using A in vitro packaging extracts prepared as 
described in Hohn (Hohn, B., 1979. Methods Enzvmol. 
68:299-303), After demonstration that each of the 
libraries gave a significant number of plaques only when 
inserts were included in the ligation, they were screened 
using restriction fragments capable of detecting the 
breakpoint clones. 

Gamma irav mutagenesis 

Adult males of the strain rv, h W sbd 2 £u 2 or st in ri 
£ p sM 2 were irradiated in plastic vials with 5000 rad of 
gamma rays from a Cs 137 source at a dose rate of 4300 
rad/minute. These were then mated to virgins of the 
appropriate strain, which were allowed to lay eggs for 
five days. 



EMS mutagenesis 

The primary lesion in EMS-induced mutations of 
bacteria and yeast is an alkylation-induced transition of 
guanine to adenine; most EMS-induced point mutations in 
Drosopfrjla can similarly be explained on this basis. 
This change would be expected to convert, on the 
complementary strand, a C in the opa repeat element to a 
T, creating an in-frame stop codon (CAGCAA to UAGCAA or 
CAGUAA) . (Ethylnitrosourea, ENU, which has been reported 
to yield a higher number of mutations for a given amount 
of sterility, is also an alkylator; however, considerably 
more stringent precautions must be taken in handling this 
mutagen. ) 

EMS was administered at 0.025 M to unstarved 1.5-5 
day-old males in 1% sucrose solution (1.5 ml on two slips 
of Whatman #2 in a 350 ml milk bottle) . Starvation of 
the males for 8 hours before EMS administration resulted 



20 
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in unacceptable levels of sterility, and males of the st 
£ p eil strain readily fed upon the EMS/sucrose solution 
without starvation. Mutagenesis was monitored by 
crossing mutagenized males to attached-X FMA3 females, 
5 Other mutants seen in this screen included a large number 

of c& alleles (many mosaic) seen over TM6B in the Fl and 
F2 generations, a dominant brown allele, and two new 
mutants, Einli, a third chromosome dominant mutation 
resembling gar , and a third chromosome dominant Curly- 

10 like mutation. Wink is easily scored (RK1) , has complete 

penetrance, and is quite healthy over TM6B. 

In the initial screen, vials were scored as mutant 
if they had fewer than 25% as many deficiency 
heterozygote as balancer heterozygote flies. On 

15 retesting, this was revised to 50% of the level seen in 

control crosses. Balancer heterozygotes were 
approximately two-thirds as viable as deficiency 
heterozygotes. 

20 Tn situ hy hiHrf Nation and evtoloaical analyses 

111 situ hybridization of polytene chromosomes was 
carried out as described in Experimental Example I (see 
Methods, section A) . Cytological analysis was performed 
. by squashing larval salivary glands in lactoacetic orcein 
25 (2% orcein, 50% acetic acid, 30% lactic acid) . 

Although the present invention has been described in 
some detail by way of illustration and example for 
purposes of clarity of understanding, it will be obvious 
that certain changes and modifications may be practiced 
30 within the scope of the claims. 



WO 91/13167 



— " i ^'T M rTmTmyirrTTn^iMiiwiiLjiK. I jii m i ij Ul j l 



u 

Mill 1 1 1 1 ■ I llllllllll III III Mill I III! 



■ Mill Ullll IM llll 111,1 



•at 



aUOOUMMXWU 



> nm i wiii i iu. m i »nm i mmT iii m m hia a aha 



i 



Ml *" 

penauc *a act ctt tx acr tm im ao cm am ax am en ox 
^Avntmin »» fla « f w me m *u i« il, S 2 S S 

IM 

AMtxtxctcAMencxcMAxcxgn Ate 

222222222222222522 

our am An tat eoa ac oat tax cm cm as am cm #v «•_. _ 

*na 1--TTCTTTXM 



Am tf «t» n.^ai r tte a. »i lm cr, 5 55 

j»**AaTAj«r. .latma t« JtfCGTtTCM 

tx osr ttc cat tie as on cat tx toe cm oar we am am tit m? »• 

i« aiy «i. nrr ai F »u «u cf ota ci f 55 Si S 5 2 5 

no 

am ta* u. Ota au i,. i w au ryr am *» S5 5 t£ HI S£ SS £ 



WATremarwiwoKwcariarMwtaeoKcioauua 



t.-CSAAtTOCM 



l Ml 

CM AM OQB CCT AtC CIS OB OX MO CM CM MS Att £*a j» .■ 

CI. L- AU Am Ita LP, AU AU «T St S 15 S 32 HI 5 SS 

cm cca ax ere ac ax cm cm aw ac cm cca oe nr nr* 
61. A- AU ta. Ata TAr 01. LP. a- g 52 £ S 2 S S S 5? 

cn etc as cm ere cm acc tar cm ne *^ *m ****•• — 

C Jaactctca. amrwi <.ja ncgac 



92 PCT/US91/01189 

ca era am ece acs ccr cm cxo cm tea cm cm cm ttc tea cm ccr ttt 
f n lm am rn au up aim im cu *»< eta au ay n» a«r gu am fm 

:«z 

cxcMCTAArrcxaxcnArecMmxeoxAnATrcxcxTTCCM 
Ai« au> vu ita am air vu ita am> top- au ciy ict tu rn «r ft» cu 

cnaCKEaaacoriMneuicieeiDiAiaQsaaeKfTcacsc 
^UMm6i*ta*»ifii-»iuni«i l ,»ua f i«ft.»i»M» 

CTO TTT C1C CSC CW ATC TOC ATC TTT CAC TX TCT ATA AAC TEA ATC ATC TCT 

L»i»p»mA«iu»Upey«i«rii»Ap*»«»A«u # A»i 1-r upu»cr» 
en aat ok cm an An cca cos cat as atc cm am osa ace ur ax ax 

UM AM Olf OU »4i »« ABJ AM A» AU UP CU *AA «f Ai» AlP Al« AfW 

ttc en an oe tec acc nc ait ttc cca cm ex *ra am tcb ato aac crs 
fptevutaMnrfptamuictauiPitaMicrtou. 

ntx 

aca gat xe cMA&axentTeTxaxAicatTcn ait ago cos cat ox 
TteApiu8uuiUruiMc»fiuaivutMtt«ikrrfiMUf 

ccc oar na ex am era cm cn aic cm am ass tac tea cca etc am ax 
rnaiTiflmtaiMOliiMUaGiiinientiuiqiMtrasir 

MO 

toe cn cm nc att ere cx cm aat ax cx gat cm ex gm ttc era ax 

C^UwCOATyrltaVAlAUCUAA-AoffwA.GLArCTGX. rt»u.u. 

am no en cm aco ato cx gat en cx ax cn ms ax cn cm acc cm 
^iMtoiaurAr«iiiipju«it|o«i«aunriMtt<araa 

am en on at? tre cos ms cm cm am gm en cn ox cm cm atc tos 
u*LMVAiVAimAMrwu«AuimataUuta«Ac<puaiafcrTTp 

IM 

zia 

TX AM CM OM OX AM AM AX CAT OX CM CM AM AM TOO CX tCB CX 
fx HXT Ota Ajp 4tT Am mm tar ap> 01? Ota Ota Am Lf ■ m* trm aar ciy 

ax too cca gat ax An cm era cm kg ox am act cca err ax Tea cr a 

tac Tip AU AM AU MXT Ajp VaL Gta AU AU LfS tftf-fn Lm Glf *•< V*l 

tea ms act cm tee ax cm cn gm tm ox act ox ax act tea cm xa 

cm ox an ter cn ex tea as ccr cm cm cm ax tea act cn ax ax 
au air viite iMMHtfnfnGuaicumiif iuiM&uiH 

tea ccr car en en aoo ax ax etc tx xa ax nr cx era cx am an 

2433 

axJUtfrxftXTKMeaBreMtxacAocAacTCMAncATAnarTax 

AUAMAMOIyAMlMOlyAMfMCtaAUAUataintaPltaVslCly 

702 

TOS CM CCA CAT CTC AX CM AM CX TO ACA ATC M9 CCS ATT ON XA CM 

i %.. 

CM CM CM CM CM CM CM CM CM ATC OCA ATA CTC MT AAT OB CAT TX 

ciAGtaataauGtaGtaGUGtaGtattaCirUPUNiAMAmAUMtatar 

CX AM TO AAT GS9 OSA CM GCff An TX CM CM CM CM CM CM CCA CM 
AMAMUwAMttTOlrauAUlOTCMGtaGtaGtaataGUAUffCOaU 

9CM119I 

CnCMOCCMtnACAOXXAOSfaXCXTMMAIUUlCrAGATTCaCX 

LwiuiuiuiMmAUClyAUiUuirfrAittniMAifMfrm 
zm - 

AOS GAT ICO OK ACT OM TOS OK AM CM AM AM CM tOC MS XS CTO AST 
ttx Am |«c Gta Cl» *»» Oly Am Ota tas Am Ota Cf« If ■ Ala Val Sac 

TCB COB OCA ACT tOS TCB TX ICC AST CCB CCT tCS MT On OAT CAT 00S CSC 
Am Oly GZr A«c A«< sw qr* sm la* res am am tar mi Am am Ala Im 

<10 

GM TX AX CAT OX GX OX AAT CM AAT CM QO KO CM CAT CX CM CTO 
Am Cm Am Am AU AU AU AM AU Am Ota tfal VAl Gta Mia fn Cta Law 

AST CTO GTS TCS Sn TCA CCA CTT CX TCS CX CM OX ICC AX AX AX Cftt 

tac vai fii kt m tat rn m art m r«> cu rn Mr tu m mi iu 

en AM CCA CM ATT On GM GAT An CX CTC CTO AM OX CTC CTO CM OCT 
tau ta» Acq Gta tta Val Ola AM nr fto Vai Lm Laa An Vil Lm Gl* Ala 

M4 

2ff72 

cx oer en tx cat ax am tx ctc *n gm cm ox tm am cx cm am 

fra Pto Lm Tyr Am fbr AM taf Lm MET Am Gta AU Tyr Lya Fro KLa Lft 

am ttc en ox cn cx cat cx cm ttc cm ax ax cm ox oat ok ax 

Lm TAp Aca) AU Lm Acs; AU Acq Gta fM Gta T&f AU Qu Ala **• Ala tar 

MT TX ACT TX OX tOS AM AX CTO ACT OX CX MT CCS CX CM AX XA 
tec Am Tkc tac Glr tac Am Aat Lm to* Ata Gly tac tra Ac* CU tar Fro 

111 

ore xs AMACTanaxAxaxasaaTxaasaxAxaxaxxAacr 
vai Fra Am tar val Ala tbf fn Fr» tra Aa# au au tar AU AU AU Gly 

AAT CCC OX CM AX CM CTO CM An CM Cn ACS CX AX AX CX AM OX 
MA CTT Cn GX TCT AM m ill GU Mf Oli Uj 111 «T tU LM TU U| IN Ur fn Iff ill 
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ACTT^TACTItAAAMCATtUT^^ 

CTTATGTTAAAAAAAnVGGTUAGArTGTAACCG 

-141 

AcraTTramaoc&a^^ 

AAAACTCfcAC&^TCaXrcX^ 

^ mRNA atarr aito 

+ 211 

. .TTAT^CJTGTXAAGAAAAC^Sn^ 

GAAGGATAAAATAATATCOCACCACrrOCTT^AAAACCAAM 

AAATTTTTTTAATAAATATT^^ 
380 

M£~ Lou MFT of^ **** ^* ^*C 

ME* Lou MET Sor Ala Asp Sor Sor Asp Sor Ala Lys Thr Sor Val ilo cy 3 sor 

! 

f!?* 5?^? GTA GCA GCA CCA GCT CQ GAA CAG CCC AOC ACC ACA 

.hr Vax Sor Ala Sor MET Lou Ala Pro Pro Ala Pro Glu Gin Pro Sor Thr Thr 

CCA CCA CCC ATT TTC GGG GTA ACA GGT CGA TCT CAC CTG GAA AAT GCC CTG AAA 
Aia Pro Pro lio Lou Giy Val Thr Gly Art? Sor Hla Lou Glu Aan Ala Lou Lya 

542 54 

^ A 5^ GCT ™G GAG GAC *AC AAG CTG OGC 

Lou Pro Pro Aan Thr Sox Val Sor Ala Tyr Tyr Gin Hi a Asn Sor Lys Lou Gly 

GOG GAA AGG AGC CTG GTA OCA CCT GTC PCA GAT 
M_* Gly Gin Aan Tyr Asn Pro Glu Pho Any Sor Lou Val Ala Pro Val Thr Asp 

CTG GAT ACT GTG CCA CCC ACA GGT GTG ACC ATG GCG AGT TCT TCG AAT TCT CCC 
Asp Thr Val Pro Pro Thr Gly Val Thr MET Ala Ser Sor Sor Aan Sor Pro 

106 

Aan Sor Sor Val Lys Lou Pro Hla Sor Gly Val lio Pho Val Sor Lys Sor Sor 

TP JPT A0C ACC ACC ^ ^ ^ ACT ^ 073 77X3 GAA GAG GAG GCG CAG 
Ala Val Sor Thr Thr Asp Gly Pro Thr Ala Val Lou Gin Gin Gin Gin Pro Gin 

812 

^ ^5 ^ 00 CAC ?:C ^ — CTO C CC OC CAC CAC CCC CAG CAG GAA 

win Gin MET Pro Gin His Pho Glu Sor Lou Pro His Kia Hia Pro Gin Gin Glu 
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"AG CAA ATG CCC CAG CAC TTC GAG TCC CTG CCC CAC CAC CAC CCC CAG CAG GAA 
Gin Gin MET Pro Gin His Pho Glu Sar Lou Pro His His Kis Pro Gin Gin Glu 

CAC CAG CCA CAG CAG CAG CAG CAA CAA CAT CAC CTT CAG CAC CAC CCA CAT CCA 
His Gin Pro Gin Gin Gin Gin Gin Gin His His Uu Gin His His Pro His Pro 



CAT GTG ATG TAT COG CAC GGA TAT CAG CAG GCC AAT CTG CAC CAC TCG GGT GGT 
His Val MET Tyr Pro His Gly Tyr Gin Gin Ala Asn Lou His His Sar Gly Gly 

ATT GCT GTG GTT COG GOG GAT TCG CGT CCC CAG ACT CCC GAG TAC ATC AAG TCC 
Ila Ala val Val Pro Ala Asp Sar Any Pro Gin Thr Pro Glu Tyr 11a Lys Sar 



TAC CCA GTT ATG GAT ACA ACT GTG GCT AGT TCG GTA AAG GGG GAA CCA GAA CTC 
Tyr Pro Val MET Asp Thr Thr Val Ala Sar Ser Val Lys Gly Glu Pro Glu Lau 



QTGAGTTGTG. .lntron 1 . ■ TTCTTTGCgg 



1082 
AAC ATA GAA 
Asn I la Glu 



TOC GGT TTC 
Sar Gly Pho 



TTC GAT GGC AOC ACA GTG CTG TGC COC GTT TGC GGG GAT AAG GCC 
Pha Asp Gly Thr Thr Val Lau Cys Arq Val Cys Gly Asp Lys Ala 



GTAAGTTCGT. .lntron 2. .Ai^m c^p 



CAT TAC GGC GTG CAT TCC TGC GAG GGT TGC AAG OGA TTC TTC CCC 
His Tyr Gly Val His Sar Cys Glu Gly Cys Lys Gly Pha Pha Arq 



CCC TCC ATC CAG CAA AAG ATC CAG TAT CGC CCG TGC ACC AAG AAT CAG CAG TGC 
Arq Sar tla Gin Gin Lys Ila Gin Tyr Arq Pro Cys Thr Lys Asn Gin Gin cys 



AGC ATT CTG CCC ATC AAT CGC AAT CGT TGT CAA TAT TGC CDC CTG AAA AAG TGC 
Sar Ila Lau Arq lia Asn Arq Asn Arq Cys Gin Tyr Cys Arq Lau Lys Lys Cys 

GTGAGTACCT. . intron 3 . . CCAATTGCAG 



ATT CCC GTG GCC ATG AGT CGC GAT GCT GTG CGT TTT GGA CGC GTG CCG AAG CGC 
Ila Ala Val Gly MET Sar Arq Asp Ala Val Arq Pha G.y Arq Val Pro Lys Arq 

g!a 2 AAG GCG CGT ATC CTG GCG GCC ATG CAA CAG ACC ACC CAG AAT CGC GGC CAG 
Zlu Lys Ala Arq Ila Lau Ala Ala MET Gin Gin Sar Thr Gin Asn Arq Gly Gin 



c«ca<»:CT50CJia:(MCTG»TWCKcacKcxp:««^ 
-in Arq Ala Lau Ala Thr Glu Lau Asp Asp Gin Pro Arq Lau Lau Ala Aia vax 

CTG CGC GCC CAC CTC GAG ACC TGT GAG TTC ACC AAG GAG AAG GTC TCG GCG ATG 
Lau Arq Ala His Lou Glu Thr Cys Glu Pha Thr Lys Glu Lys Val Sar Aia nti 



GTAAGTCTCA. . intron 4 . .ATTTCjgg 




CCG CAG CCG GCG CCG GAT TCC CCC TCC TAC TCC ATG CCC ACA CTT CTG GCC TGT 
Arq Gin Arq Ala Arq Asp Cys Pro Sar Tyr Sar MET Pro Thr ^u Lau Ala 
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2432 

GCC AAT TCC GOC TCC AOC GGT GAC TCC CGA CCA GCT GAG ATG GAT ATC CTT CGC 
Ala Asn Sor Gly Sor Sor Gly Asp Sor Giy Ala Ala Giu MET Asp Ilo Vai Gly 

702 

TOG CAC GCA CAT CTC ACT CAG AAC GOG CTC ACA ATC ACG OCG ATT GTG CGA CAC 
Sor His Ala His Lou Thr Gin Asn Gly Lou Thr Ilo Thr Pro Ilo Vai Arq His 

QJAGTATCTT. .lntron 5 . . TTTCTTAg ^G 

Y= 

CAG CAG CAG CAA CAA CAG CAG CAG CAG ATC GGA ATA CTC AAT AAT GCG CAT TCC 
Gin Gin Gin Gin Gin Gin Gin Gin Gin Ilo Gly Ilo Leu Asn Asn Ala Ris Sor 

CGC AAC TTC AAT GGG GGA CAC GCG ATG TGC CAG CAA CAG CAG CAG CAC CCA CAA 
Arq Asn Lou Asn Gly Gly His Ala MET Cys Gin Gin Gin Gin Gin His Pro Gin 

75 6 

G(Dro4 925) 

CTC CAC CAC CAC TTG ACA GOC GGA GCT GOC CGC TAC AGA AAG CTA GAT TCG OCC 
Lou His His His Lou Thr Ala Gly Ala Ala Arq Tyr Arq Lys Uu Asp Sor Pro 

Arq 

2702 - 

AOG GAT TCG GGC ATT GAG TOG GGC AAC GAG AAG AAC GAG TGC AAG OCG GTG AGT 
Thr Asp Sor Gly Ilo Glu Sor Gly Asn Giu Lys Asn Giu Cys Lys Ala Vai Sor 

TOG GGG GGA AGT TOC TOG TGC TOC AGT COG CGT TCC AGT GTG GAT GAT GCG CTC 
Sor Gly Gly Sor Sor Sor Cys Sor Sor Pro Arq Sor Sor Vai Asp Asp Ala Lou 

810 

GAC TOC AOC GAT GOC GOC GOC AAT CAC AAT CAG GTG GTG CAG CAT CCG CAG CTG 
Asp Cys Sor Asp Ala Ala Ala Asn His Asn Gin Vai Vai Cln His Pro Gin i*u 

f 

AGT GTG GTC TCC GTG TCA CCA GTT COC TCG CGC CAG CCC TCC ACC AGC AGC CAT 
Sor Vai Vai Sor Vai Sor Pro Vai Arq Sor Pro Gin Pro Sor Thr Sor Sor His 



CTG AAG CGA CAG ATT GTC GAG GAT ATG COC GTG CTC AAG CCC GTG CTG CAG OCT 
Lou Lys Arq Gin Ilo Vai Glu Asp MET Pro Vai Lou Lys Arq Vai Lau Gin Ala 

864 

2972 

COC CCT CTC TAC GAT ACC AAC TOG CTG ATG GAC GAG OCC TAC AAG CCG CAC AAG 
Pro Pro Lou Tyr Asp Thr Asn Sor Lou MET Asp Glu Ala iyr lys Pro His Lys 

AAA TTC COG GOC CTG COG CAT COC GAG TTC GAG ACC OCC GAG OCG GAT OCC AGC 
Lys ?ho Acq Ala Lou Arq His Arq Glu ?ho Glu Thr Ala Glu Ala Asp Ala Sor 

AGT TOC ACT TOC GQC TOG AAC AGC CTC AGT GOC COC AGT OCG CGC CAG AGC CCA 
Sor Sor Thr Sor Gly Sor Asn Sor Lou Sor Ala Gly Soc Pro Arq Gin sor . ro 

9i 8 

GTC CCG AAC AGT GTG OCC AOG COC COG CCA TCG GOG GOC AGC GCC GCC GCA CGT 
Vai Pro Asn Sor Vai Ala Thr Pro Pro Pro Sor Ala Ala Sor Ala Ala Ala o A y 

AAT OCC OCC CAG AGC CAG CTC CAC ATG CAC CTC AOC CGC AOC AGC COC AAG GCC 
Asn Pro Ala Gin Sor Gin Lou His MET His Lou Thr Arq Sor Sor Pro uys Aia 




WO 91/13167 y « PCT/US91/01189 

3242 

TCG ATG GCC AGC TCG CAC TCG GTC CTG Cw AAG TCT CTC ATG GCC GAG COG CGC 

3ar MET Ma Ser Ser His Ser Val Leu Aia Lys Ser Leu MET Ala Glu Pro Are 

972 

ATG ACG CGC GAG CAG ATG AAG CGC AGC GAT ATT ATC CAA AAC TAG TTG AAG CGC 

MET Thr Pro Glu Gin MET Lys Arg Sor Asp lie II© Gin Asn Tyr Lou Lys Arg 



GAG AAC AGC ACA GCA GCC AQC AGC ACC ACC AAT GGC GTC GGC AAC CGC AGT CCC 
Glu Asn Sor Thr Ala Ala Ser Ser Thr Thr Asn Cly Val Giy Asn An? Ser Pre 



AGC AGC AGC TCC ACA CCG CCC CCG TCG GCG CTC CAG AAT CAG CAG CCT TGG GGC 
Sor Sor Sor Sor Thr Pro Pro Pro Ser Ala Val Gin Asn Gin Gin Arq Tro Giy 

1026 

AGC AGC TCG GTG ATC ACC ACC ACC TGC CAG CAG CGC CAG CAG TCC GTG TCG CCG 
Sor Sor Sor Val lie Thr Thr Thx Cys Gin Gin Arg Gin Gin Sor Val Sor Pre 

3512 

CAC AGC AAC GGT TCC AGC-TCC AGT TCG AGC TCT AGC TCC AGC TCC ACT TOG TCA 
His Sor Asn Giy Sor Sor Sor Ser Sor Sor Sor Sor Ser Ser Sor Sor Ser Ser 



TCC TCC TCC ACA TCC TCC AAC TOC AGC TCC AGC TCG GOC AQC AGC TGC CAG TAT 
Sor Sor Sor Thr Sor Sor Asn Cys Sor Sor Sor Sor Ala Ser Sor Cys Gin Tyr 

108D 

TTC CAG TCG COG CAC TCC ACC AGC AAC GGC ACC AGT GCA CCG GCG AGC TCC AGT 
Pho Gin Sor Pro Els Sor Thr Sor Asn Giy Thr Sor Ala Pro Ala Ser Ser Ser 



TCG GSA $CG AAC AGC GCC ACG CCC CTG CTC GAA CTG CAG GTG GAC ATT GCT GAC 
Sor Giy Sor Asn Sor Ala Thr Pro Lou Lou- Glu Lou Gin Val Asp Ilo Ala Asp 



TCG GCG CAG OCT CTC AAT TTG TCC AAG AAA TCG CCC ACG CCG CCG CCC AOC AAG 
Ser Ala Gin Pro Leu Asn Lou Sor Lys Lys Sor Pro Thr Pro Pro Pro Ser Lys 

1134 

3782 

CTG CAC OCT CTG GTG GCC GCC GCC AAT GCC GTT CAA AGG TAT CCC ACA TTG TCC 
Lau His Ala Leu Val Ala Ala Ala Asn Ala Val Gin Arg Tyr Pro Thr Leu Ser 



GCC GAC CTC ACA GTG ACA GOC TCC AAT GGC GOG TCC TCC GTC GGC GGC GGC GAG 
Ala Asp Val Thx Val Thr Ala Ser Asn Giy Giy Sor Sor Val Giy Giy Giy Glu 



TCC GGC OGC CAG CAG CAG TCC GCC GGC GAG TCT GOG CTC CCC CAA TCC GGG CCT 
Ser Giy Arg Gin Gin Gin Sor Ala Giy Glu Cys Giy Lou Pro Gin Ser Giy Pro 

1186 

GAG CGC CGC CCT GCA CAA GGT AAT GCT GGA GGC GTA AGA GOG GGA GGA GGT AGG 
Glu Arg Arq Arg Ala Gin Cly Asn Ala Giy Giy Val Arg Ala Giy Giy Giy Arg 



TGG TTT TAC GCG GAG AAG TGG GAG AGA CAG AGA CTC GGA GTG GCA GTT CAG CGA 
Trp Pho Tyr Ala Glu Lys Trp Glu Arg Gin Arg Lou Giy Val Ala Val Gin Arg 

4052 

AGC AGG AAG CAG GAT CAC TTC GAG CGG CGG GAG TTC AAT TAA 
Sor Arg Lys Gin Asp Ris Lou Glu Arg Arg Glu Lou Asn 

1237 



Prtntad from MlmoM 



99 

WO 91/13167 PCT/US91/01189 

AGC 2 AGG AAG CAG GAT CAC TTG GAG CCC COG GAG TTG AAT TAA 
Sar An? Lya Gin Asp His Lou Glu Art? Art* Giu Lau Asn^ . 

ATTATTTTACCATTTAATTGAGACGTGTACAAACTTTCiAAAG 

TAATATlTAAAGCAACAACAAACAAAACAACrACAAGTTATTAAm 

AACAAAAAACCCAAGCTTC^TGGTATTACAAMGAAAAAG^ 

XAGTTAAACTTTAACCTAGCAAGAAAC^ 

^GC^ACCG^OGCCCC^^ 

CGACOerTGAITCTTTrATAAGTTTTAAU.ll. I'iUl T G T A CATATTAATTACGTTTATTGGTAACTATGT 

^AGCGCTTTACTTCTAGTTGGAGCAAAACTAC^ 

TATTATTAiaAATTTTTAAATACCTAAAAACAAAACAATGTC^^ 

| ^ poly A aita cOn*927 and c)>b4928 

AAGCAGAATGAAGTGCACTTTGCAACAAAOT 

ATTTAACTjjATTTTATCAATTTCTTATAGTTTTrc 

ScTAlTXTrCATTTCTCCTTTaGCACAAAa 

^TTTTCTAATTTTAAAACTACC^^ 

A^GCTTAAAGTTTTAGCAGCAACCAGTAAAACGACX^rc 

rr<rCTACATAGTTAAATC7rTAAATTAAACACAAAAACATATTTAA^ 

^MT^GAAACXTATCCTTAAAACATTC^ 

AAAAAAAAATCAAOVAAC^AAAATTATGATATATTATTTAAAAG^ 

AACAAAAGAGAAAACTTGCCGTAGCAACAAMCTAT^^ 

ATGATTATTAATTATT ATGA TTAATTAATTACGATTTTTATGCTTAGACAAACCAACAAAAAACAAATAT 

nfJUVAAA OCATTAAAA AAAAAAACAAAAAACAAGCAAAAAAT 

.putativB poly««"yl* cton al, » nal icC l0nq cranscxlp " 
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B 

n"ACC^XGGGATCXXXSTAATCrXX^^ 
iwS^GCOGAACCAACACAACAC^^ 



fcjji .'^."."rrcA vi^oCAAA * I w o " l Lot 1 oUC AGCGG". ' 

I 

1 ^ mRNA scan sit© 

TGAGATC^GTGAAGTGTGCAAI2TTCCTTA^ 

284 

CTCTCATTTCGAAGCCAAAAAGCAAAATCTCTAATTCAAAT ATG G?7 TCT GCA ATG CAA 

KET Val Cys Ala MET C-r. 

302 

GAG G7T GCT GCT GTG CAG CAT CAG CAG CAG CAA CAG CAA CTC CAG 77G CCC C\G 
Ciu Val Ala Ala Val Gin His Gin Gin Gin Gin Gin Gin Lau Gin Lau Pro Glr. 

24 

CAG CAA CAfc CAG CAG CAG CAG ACA ACA CAG CAG CAA CAT OCA ACA ACG ATA GTG 
Gin Gin Gin Gin Gin Gin Gin Thr Thr Gin Gin Gin His Ala Thr Thr lie Val 

CTC CTG ACG GGC AAT GGC GGC GGT AAT CTG CAC ATT GTC OCC ACA CCG CAA CAG 
Leu Lou Thr Giy Asn Gly Gly Gly Asn Lau His lie Val Ala Thr Pro Gir. Ci-r. 

CAT CAG CCG ATG CAT CAG CTC CAC CAT CAG CAT CAG CAT CAG CAT CAG CAC CAC 
His Gin Pro KET His Gin Leu His His Gin His Gin His Gin his Gin His Glr. 

78 

CAG CAG GCC AAG AOC CAA CAG CTC AAG CAA CAA CAC TCG GCC CTG CTC AAG T7C 
Gir. Gin Ala Lys Sar Gin Gin Lou Lys Gin Gin His Sar Ala Leu Val Lys Lou 

572 

CTG GAG TCG GCG CCC ATC AAG CAG CAA CAG CAG ACG CCC AAG CAA ATT CTT TAC 
Leu Giu Sar Ala Pro lie Lys Gin Gin Gin Gin Thr Pro Lys Gin lie Vai Tyr 



CTG CAG CAG CAG CAG CAG CAA CCG CAA CCC AAA AGA CTG AAA AAC GAA OCA CCA 
Lau Gin Gin Gin Gin Gin Gin Pro Gin Arg Lys Arq Lau Lys Asn Glu Ala Aia 

132 

ATC GTA CAA CAG CAA CAA CAA ACA CCT GCA ACA CTA GTA AAG ACA ACA ACC ACC 
Ila Val Gin Gin Gin Gin Gin Thr Pro Ala Thr Lau Val Lys Thr Thr Chr Thr 



AOC AAC AOC AAC AOC AAC AAC ACC CAG ACA ACA AAT ACT ATT AGT CAG CAG CAA 
Sar Asn Ser Asn Sor Asn Asn Thr Gin Thr Thr Asn Sor I.o Sar Gin Gin Glr. 
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AOC AAC AOC AAC AOC AAC AAC ACC CAG ACA ACA AAT ACT ATT ACT CAG CAG CAA 
Sar Asn Sar Asn Sar Asn Asn Thr Gin Thr Thr Asn Sar lie Sar Gin Gin Gin 



CAG CAG CAT CAG ATT CTG ?T0 CAC CAC CAG CAG CCA GCC GCG OCA OCA ACA CCA 
Gin Gin Ki3 Gin lie vMi Lau Gin His Gin Gin ?rc Ala-Aia Ala Ala Thr Pro 

186 

842 

AAG CCA TGT GOC GAT CTG AGC GCC AAA AAT GAC AOC GAG TCC GGC ATC GAC GAG 
Lys Pro Cys Ala Asp Lau Sar Ala Lys Asn Asp Sar Giu Sar Giy lie Asp Giu 



GAC TOC CCC AAC AOC GAT GAG GAT TOC CCC AAT GOC AAC CCG OCG GOC ACA TCG 
Asp Cys Pro Asn Sar Asp Glu Asp Cys Pro Asn Ala Asn Pro Ala Gly Thr Ser 



CTC GAG GAC AGC AGC TAC GAG CAG TAT CAG TOC CCC TGG AAG AAG ATA CCC TAT 
Leu Glu Asp Sar Sar Tyr Glu Gin Tyr Gin Cys Pro Trp Lys Lys lie Arg Tyr 

240 

GOG CGT GAG CTC CTC AAG CAG COC GAG TTG GAG CAG CAG CAG ACC ACC CGA GGC 
Ala Arq Glu Lau Lau Lys Gin Art? Glu Lou Giu Gin Gin Gin Thr Thr Giy Giy 



AOC AAC GCC CAG CAG CAA CTC GAG GCG AAG CCA GCT GCA ATA CCC ACC AGC AAC 
Sar Asn Ala Gin Gin Gin Val Glu Ala Lys Pro Ala Ala lie Pro Thr Sar Asn 

1112 

ATC AAG CAG CTG CAC TGT GAT AGT CCC TTT TOG GOG CAG ACC CAC AAG GAA ATC 
Ila Lys Gin Lau His Cys Asp Sar Pro Pho Sar Ala Gin Thr His Lys Giu lia 



GCC AAT CT^i CTG CCC CAA CAG TCC CAG CAA CAA CAG GTT GTG GCC ACG CAG CAG 
Ala Asn Lau Lau Arq Gin Gin Sar Gin Gin Gin Gin Vai Val Ala Thr Gin ^n 



CAG CAG CAA CAG CAG CAG CAG CAC CAG CAC CAG CAA CAA CGA AGG GAT AGC TCC 
Gin Gin Gin Gin Gin Gin Gin His Gin His Gin Gin Gin Arq Arq Asp Sar Sar 

CAC AOC AAC TOC TOG CTG ATG AOC AAC TCG AOC AAC TCC AGT OCG OGC AAT TGT 
Asp Sar Asn Cys Sar Lau K£T Sar Asn Sar Sar Asn Sar Sar Ala Giy Asn Cys 

TOC AOC TOC AAC GCT GOC GAC GAC CAG CAG CTG GAG GAG ATG GAC GAG GCC CAC 
Cys Thr Cys Asn Ala Gly Asp Asp Gin Gin Lou Giu Glu MET Asp Giu Ala nis 

1382 

GAT TCG GCC TOC CAC GAT GAA CTT TOC GAG CAG CAT CAC CAG CGA CTG GAC ^ 
Asp Sar Giy Cys Asp Asp Glu Lau Cys Glu Gin His His Gin Axg -au Asp sar 



TCC CAA 
Sar Gin 



CTG AAT TAC CTG TOC CAG AAG TTC GAT GAG AAA CTG GAC ACG GC*j CTG 
Lau Asn Tyr Lau Cys Gin Lys Pha Asp Glu Lys i^u Asp Thr Ala ^u 

AOC AAC AOC AOC GCC AAC ACG GOG AOG AAC ACC CCA OCT GTA ACA OCT AAC GAA 
Ser Asn Sar Sar An Asn Thr Gly Arq Asn Thr Pro Ala Val Thr Ala Asn Giu 



1544 

nAT COC CAT qxaqqtttaq 
Asp Ala Asp 
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WHAT IS CLAIMED *IS : 
lm An isolated recombinant nucleic acid which, 

upon expression, is capable of coding for other than a 
native vertebrate steroid receptor or fragment thereof, 
said nucleic acid comprising a segment having a sequence 
substantially homologous to a coding region of domains A, 
B, D, E or F from an insect steroid receptor superfamily 
member gene having substantial homology to a steroid 
binding domain. 

2. An isolated recombinant nucleic acid of 

Claim 1, wherein said insect steroid receptor superfamily 
member is EcR, DHR3, E75A or E75B. 

3 . An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid encodes a polypeptide 
capable of binding to a ligand for an insect steroid 
receptor superfamily member. 

4. An isolated recombinant nucleic acid of 
Claim 1, wherein said nucleic acid is capable of 
hybridizing to an insect steroid receptor superfamily 
member gine segment under selective hybridization 
conditions. 

5. An isolated recombinant nucleic acid of 
Claim 4, wherein said selective hybridization conditions 
are stringent hybridization conditions. 

6. A cell transformed with an isolated recombinant 
nucleic acid of Claim 1. 

7. An isolated recombinant nucleic acid having a 
sequence exhibiting identify over 20 nucleotides of a 
coding segment of an insect steroid receptor superfamily 
member having steroid binding domain homology. 

8 - An isolated recombinant nucleic acid of 

Claim 7, wherein said nucleic acid encodes a polypeptide 
which binds to a control element responsive to a ligand 
of an insect steroid receptor superfamily. 

9. a cell transformed with an isolated recombinant 

nucleic acid, of Claim 7. 
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10. An isolated recombinant nucleic acid comprising 

a DNA sequence capable of binding to an insect steroid 
receptor superfamily member other than 20-OH ecdysone 
receptor. 

5 11. An isolated recombinant nucleic acid of 

Claim 10, wherein said insect steroid receptor 
superfamily member is DHR3, E75A or E75B. 

12. An isolated recombinant nucleic acid of 
Claim 10, wherein said DNA sequence promotes 

10 transcription of an operably linked sequence in response 

to binding by said insect steroid receptor superfamily 
member. 

13. An isolated recombinant nucleic acid of 

Claim 10, wherein said DNA sequence is operably linked to 
15 a DNA sequence encoding a polypeptide. 

14. An isolated recombinant nucleic acid of 
Claim 10 , wherein said isolated recombinant nucleic acid 
is an expression vector. 

15. x a cell transformed with an isolated recombinant 
20 nucleic acid of Claim 10. 

16. A cell of Claim 15, wherein said cell also 
comprises said insect steroid receptor superfamily 
member. 

17. a recombinant nucleic acid comprising: 

25 a control element responsive to a ligand of an 

insect steroid receptor superfamily member 
ligand responsive control element; 
a non-heat shock promoter sequence; and 
a sequence comprising a reporter gene. 
30 18. A recombinant nucleic acid of Claim 17, wherein 

said non-heat shock promoter sequence is an alcohol 
dehydrogenase promoter. 

19. A cell transformed with a recombinant nucleic 

acid of Claim 17. 
35 20. A cell of Claim 19, wherein said cell is a 

mammalian cell. 
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21. A recombinant nucleic acid of Claim 17 f 

comprising additional copies of control elements 
responsive to a ligand of an insect steroid receptor 
superfamily member. 
5 22. A method for monitoring expression of a 

reporter gene comprising the steps of: expressing a 
recombinant nucleic acid of Claim 17, wherein no 
radioactive reagent is used in said monitoring of said 
reporter gene. 

10 23. A polypeptide comprising an insect steroid 

receptor superfamily member or fragment thereof, wherein 
said polypeptide is substantially free of naturally- 
associated insect cell components and exhibits a 
biological activity characteristic of an insect steroid 

15 receptor superfamily member with a steroid binding 

domain. 

24. A polypeptide of , Claim 23, wherein said insect 

steroid receptor superfamily member is selected from the 
group consisting of EcR, DHR3, E75A and E75B. 
20 25. i A polypeptide of Claim 23, wherein said insect 

steroid receptor superfamily member or fragment thereof 
also comprises a DNA binding domain. 

26 - A polypeptide of Claim 23, wherein said insect 

is Drosophila melanogaster. 
25 27. A polypeptide of Claim 23, which is capable of 

binding to a hormone analogue selected from the group 
consisting of an insect hormone and an insect hormone 
agonist. 

28. A polypeptide of Claim 27, wherein said insect 
30 hormone is an ecdy steroid. 

29. A polypeptide of Claim 27, wherein said insect 
hormone is 20-OH ecdysone. 

30. A polypeptide of Claim 23, which is capable of 
binding to a DNA control element responsive to an insect 

35 hormone. 

31- A polypeptide of Claim 30, wherein said 

polypeptide comprises a zinc-finger domain. 
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32. A polypeptide of Claim 30, wherein said insect 

hormone responsive DNA control element is operably linked 
to a transcription unit which is responsive to said 
binding. 

5 33. a polypeptide of Claim 32, wherein said insect 

hormone responsive DNA control element is upstream from 
said transcription unit. 

34. a polypeptide of Claim 23 fused to a second 

polypeptide. 

10 35. A polypeptide of Claim 34, wherein said second 

polypeptide is a heterologous polypeptide. 
36. A polypeptide of Claim 35, wherein said 

heterologous polypeptide comprises a second steroid 
receptor superfamily member. 

15 37. A polypeptide of Claim 23, wherein said 

fragment has a sequence substantially homologous to a 
consensus El region sequence. 

38. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 

20 consensu^ E2 region sequence. 

39. A polypeptide of Claim 23, wherein said 
fragment has a sequence substantially homologous to a 
consensus E3 region sequence. 

40. A polypeptide of Claim 23, wherein said 
25 fragment has a sequence comprising: 

a segment at least about 25% homologous to a 

consensus El region sequence; 
a segment at least about 30% homologous to a 

consensus E2 region sequence; and 
30 a segment at least about 30% homologous to a 

consensus E3 region sequence. 

41. a composition of matter comprising a 
polypeptide of Claim 23. 

42. A cell comprising a polypeptide of Claim 23. 
35 43. A cell of Claim 42, wherein said cell is a 

human cell. 
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44. An antibody or binding fragment thereof 
exhibiting binding specificity to a polypeptide of 
Claim 23, wherein said binding specificity is directed to 
an epitope characteristic of an insect steroid receptor 

5 super family member, 

45. A method for selecting DNA sequences capable of 
being specifically bound by an insect steroid receptor 
superfamily member, said method comprising the steps of: 

screening DNA sequences for binding to a polypeptide 
10 of Claim 23; and 

selecting said DNA sequences exhibiting said 
binding. 

46. A method of Claim 45, wherein said DNA sequence 
is operably linked to a gene selected from the group 

15 consisting of EcR, DHR3, E74 and E75 genes. 

47. A method for selecting ligands specific for 
binding to a hormone binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps of: 

20 screening compounds for binding to one or more 

superfamily members; and 
selecting compounds exhibiting specific binding to 
the members. 

48. A method of Claim 47, wherein said ligand is an 
25 ecdysteroid. 

49. A method of Claim 47, wherein said ligand is a 
20-OH ecdysone antagonist. 

50. A method for modulating insect physiology or 
development comprising the steps of: 

30 screening compounds for binding to an insect steroid 

receptor superfamily member; 
selecting said compounds exhibiting said binding; 
and 

administering to an insect said ligand. 
35 51. A method of Claim 50, wherein said modulating 

is lethal to said insect. 
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52. A fusion polypeptide comprising a hormone 
binding domain of an insect steroid receptor superfamily 
member and a second polypeptide. 

53. A fusion polypeptide of Claim 52, wherein said 
second polypeptide comprises a DNA binding domain from a 
second steroid receptor superfamily receptor member. 

54. A nucleic acid encoding a fusion polypeptide of 
Claim 52. 

55. A method for selecting ligands specific for 
binding to a ligand binding domain of an insect steroid 
receptor superfamily member, said method comprising the 
steps, of: 

combining: 

(i) a fusion polypeptide of Claim 52, wherein 

said fusion polypeptide comprises 
said ligand binding domain 
functionally linked to a DNA binding 
domain of a second steroid receptor 
j superfamily member; and 

(ii) a second nucleic acid sequence encoding a 

second polypeptide, wherein 
expression of said second nucleic 
acid sequence is responsive to 
binding by said DNA binding domain; 
and 

screening compounds for an activity of inducing 
expression of said second polypeptide; and 
selecting said compounds. 

56. A method of Claim 55, wherein said combining 
occurs within a cell. 

57. A method of Claim 56, wherein said combining 
step results from expression upon transformation of said 
cell with a nucleic acid encoding said fusion 
polypeptide. 

58. A method for modulating insect physiology 
comprising the step of: 
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administering to an insect a ligand selected by a 
method of Claim 55. 

59- A method of Claim 55, wherein said binding 

domain is selected from binding domains of insect steroid 
receptor superfamily members selected from the group 
consisting of EcR, DHR3, E75A and E75B. 
60. A method of Claim 58, wherein said ligand is 

lethal to said insect. 

61- An isolated receptor control element non- 

responsive to 20-OH ecdysone comprising a DNA segment 
capable of binding to an insect steroid receptor 
superfamily member and capable of controlling expression 
of an operably linked gene. 

62. An isolated receptor control element of 
Claim 61, wherein said operably linked gene is within 
about 50 kb of said control element. 

63. An isolated receptor control element of 
Claim 61, wherein said insect steroid receptor 
superfamily member is DHR3, E75A or E75B. 

64. a method for producing a polypeptide comprising 
the steps of: 

selecting a cell which is substantially insensitive 
to exposure to an insect steroid receptor 
superfamily ligand; 
introducing into said cell: 

(i) a receptor for said ligand; and 
(ii) a nucleic acid sequence encoding said 

polypeptide, said nucleic acid 
sequence operably linked to a 
control element responsive to 
presence of said selected 
ligand, wherein a transformed 
cell is produced; and 
exposing said transformed cell to said ligand. 

65. A method of Claim 64, wherein said cell is a 
mammalian cell. 
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66. A method of Claim 64, further comprising the 
step of introducing said cell into an intact organism. 

67. A method of Claim 64, wherein said cell is a 
plant cell. 

68. A method of Claim 64 , wherein said ligand is 
20 -OH ecdysone. 
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